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PREFACE 


This report describes a new automated process that pioneers full-scale 
operational use of subject switching by the NASA Scientific and Technical 
Information (STI) Facility. The subject switching process routinely 
translates machine-readable subject terms from one controlled vocabulary 
into the equivalent terms of another controlled vocabulary. To do subject 
switching, we use a system called the NASA Lexical Dictionary (NLD). The 
report also describes the NLD, how to build a lexical dictionary system, 
what resources are needed, and how to maintain the system after it's built. 
The experience of the NASA STI Facility with their fiist input vocabulary, 
that of the Defense Technical Information Center (DTIC), is included in the 
section labeled HISTORY. 

We would like to acknowledge the help given to this project by 
personnel at DTIC. Without their cooperation the construction of the NLD 
would have been more difficult and costly. 

Work on the NLD, done by Planning Research Corporation/Government 
Information Systems, was supported by the National Aeronautics and Space 
Administration's Scientific and Technical Information Branch under contract 
NASw-3330. The period of performance covertd by this report is from 
November 2, 1981 to December 31, 1983. 


EXECUTIVE SUMMARY 

The NASA Lexical Dictionary (NLD) , a system that automatically 
translates input subject terms to those of NASA, was developed in four 
phases. Phase One provided Phrase Matching, a context sensitive 
word-matching process that matches input phrase words with any NASA 
Thesaurus posting (i.e. index) term or Use reference. Other Use references 
have been added to enable the matching of synonyms, variant spellings, and 
some words with the same root. Phase Two provided the capability of 
translating any individual DTIC term to one or more NASA terms having the 
same meaning. Phase Three provided NASA terms having equivalent concepts 
for two or more DTIC terms, i.e. coordinations of DTIC terms. Phase Four 
was concerned with indexer feedback and maintenance. Although the original 
NLD construction involved much manual data entry, ways were found to 
automate nearly all but the intellectual decision-making processes. In 
addition to finding improved ways to construct a lexical dictionary, new 
applications for the NLD have been found and are being developed. 
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INTRODUCTION 


Purpose 

The purpose of the NLD is to minimize the indexing of documents 
already indexed by another agency. Approximately half of the report 
literature added to the NASA STI Facility data bases each year has been 
previously cataloged, abstracted, and indexed by another agency. See 
Figure 1. Much of this previously processed material is received at the 
NASA STI Facility (hereafter referred to as the Facility) in 
machine-readable form on magnetic tape. The Facility's objective is to 
accept as much as possible of this work and the NLD is part of the overall 
effort. The NLD accepts, in machine-readable form, words and phrases from 
the document record created by another agency and translates them into 
valid NASA index terms. The words and phrases that are run through the NLD 
are normally terms from a controlled vocabulary such as DTIC's. However, 
it is possible to take a title or a line or two of text and treat it as if 
it were a long phrase. While the Access Routine was not designed to select 
phrases from text, it can be used to generate posting terms from a limited 
amount of text such as a title, title supplement, note of content, or from 
words and phrases from any machine-readable source. The terms generated in 
this manner must be reviewed and may need to be edited by the indexer. 


Significance 

The NLD Subject Switching system is a flexible tool. It could be 
implemented as a time-saving device by any organization that accessions and 
reindexes documents that have been indexed by another organization. The 
components of the NLD provide the basis on which to build either a system 
for automatically indexing text, or a system for the automatic translation 
of index terms from any controlled vocabulary to another. 

Definition of the NASA Lexical Dictionary 

A lexical dictionary has been defined in several ways. 
Paul H. Klingbiel, who initiated the NLD, defines it two ways in his latest 
work (ref. 1): as "a phrase structure rewrite system" and as "a matrix." 
Roxanne Newton defined the NLD (in the "System Overview" written for 
Facility use) as "a translation device." These different descriptions 
represent different points of view. To a mathematician, a lexical 
dictionary is a matrix; to a linguist, it's a grammar; to an accountant, 
the system may resemblea spreadsheet; but to those dealing with operating 
systems, the lexical dictionary is a translation device. June Silvester 
adds that, to the indexer, the lexical dictionary is a tool. 

This report addresses itself primarily to the operating system 
definition— that is, that the lexical dictionary is a translation device, 
and secondarily to the idea that the lexical dictionary is an indexer tool. 
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Scope of the NLD 


m a c a T . h ®, orlgina l N1 - D project, which aimed to translate DTIC indexing to 
NASA indexing, involved two procedures. First, the translation of the 
conce Pt of every individual term in DTIC 's controlled vocabulary to one or 
more NASA terms that express the same concept. Second, the translation of 
two or more coordinated DTIC terms to the NASA term or terms expressing the 
maca concept. .1/ the concept of coordinated terms required more than one 
NASA term for its translation, the NASA terms must be different from the 
DTIC terms. For example: 


DTIC 


administrative personnel 

aeroelasticity 

aircraft;drones 

army planning;army research 


The DTIC 
review. 


NASA 

management, personnel 
aeroelasticity 
drone aircraft 

project planning, armed forces (United 
States) 

are given to the indexers for 
functions: 


terms and their NASA equivalents 
Indexer review consists of four 


• Accept or reject NASA terms listed by the NLD. 

• any . terms not listed h y the NLD that are necessary for the 
NASA environment. 

• Indicate which terms are major terms. 

• Recommend NLD changes such as: ways of improving translations, 

deleting irrelevant translations, or adding terms that should be 
coordinated to translate to a single term. 


Preliminary Results 


time, 
tool , 


Management expected that the use of the NLD would shorten indexing 

a " d J,n a St Based on a questionnaire that was used as an evaluation 
the NLD saves at least three minutes per document indexed. 


The project personnel expected that the use of the NLD would make 
indexing more of a decision-making process and less of a lookup job — 
lookup the term, lookup the word form, lookup the spelling, etc. The NLD 


The indexers had mixed expectations. Some feared that the NLD would 
eliminate their jobs. The NLD has not done that. Many terms are context 
sensitive and to maintain high quality indexing at the Facility, the 
decision was made to require indexer review. The NLD provides a 

team-approved translation of any input word or phrase. The indexer 
provides a check on context sensitive selections, a choice of pertinent 
NLD-suggested terms, and any additional index terms needed to serve the 
NASA environment. We feel that this combination makes the NLD an expert 
system (see section on SYSTEM DESCRIPTION). P 



Indexers who understand best how the NLD operates and what its output 
means are the most enthusiastic. Indexers must be trained to achieve 
optimal use of the NLD, but the training required is minimal. 


Presentation 

c k' T 5 is c dsts'Ms the resources required for implementing the NLD 

Subject Switching System and provides a step-by-step implementation plan. 
It includes a system overview, the NASA experience with DTIC-to-NASA 
subject Switching, the three-phase implementation plan which we followed 
and some recommendations for doing it more easily. It also describes 
system maintenance--what is involved and how to do it. Finally, we discuss 
the benefits, problems, and future of the NASA Lexical Dictionary. 

Project Personnel 

... . Jhe NLD has had three project directors since its inception: Paul H. 

Klingbiel, Roxanne Newton, and former analyst June P. Silvester' 
Programming has been provided by Elaine Sellman, succeeded by Duchesne 

ar p’ t n - d Pa . tncia Carr °H. They were assisted by Midori Keech and 
Nina Kit. Posting term translations were done by senior retrieval analyst 
Edna Flsek, lexicographer Ron Buchan, abstracting/indexing supervisor 
Jacqueline Streeks , Klingbiel, Newton, and Silvester. The project was Il?o 
supported by the publications and data entry staff. 


The NLD team for this task consisted of the project director 
analysts, lexicographer, and the abstracting/indexing supervisor! 
Translations done by any team member were reviewed by other team members 


SYSTEM DESCRIPTION 


Significance Within the Larger System 

As we have stated, the lexical dictionary is a f ool Althmmh n- -i, 

another^ S^"d£ P 

procedures by relieving Indexers of ^u'rely mSchanlcal ‘ iaL- nde *iK2 

asr .k ,- : 5’5 3. Hi™; 

none exis”d Un be C ?o,T'' 0r “ ,eaSt SOme co ™ u " f «tion where often little or 


Expert System Concept 

rss2„r:. 

s'iSSw" ■■ 

(1) a knowledge base (or knowledge source) of domain fartc 

heuristics associated with the problem; J facts and 

(2) an inference procedure (or control structure) for utilizino th* 

knowledge base in the solution of the problem; 9 the 

(3) a working memory- "global data base"-for keepinq tra-k of th* 
probTem status, the input data for the particular problem and 
the relevant history of what has thus far been done. 

(ref. 2, p. 80). 

,-nr-i,J he i NLt ? hasdata fil es created by NASA vocabulary experts The fiioc 
input source to be translated into authorized NASA fniex te™s ° m 3 " 


The working memory for the NLD traces input material through 


the 
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translation process. The memory makes it possible to print lists of input 
terms with their translations, input terms with partial translations, input 
terms with no translations, and statistics about the logic codes used. 

Thus the NLD meets the basic criteria for an expert system. 

Another way in which the NLD system might be considered expert is its 
use of the best of two approaches: Indexer Simulation and Indexer 
Feedback. 

Indexer Simulation . A lexical dictionary simulates the indexer's 
translation of input terms into target vocabulary terms. The printout 
provided by the lexical dictionary lists the terms from the input 
vocabulary and the terms from the target vocabulary, side by side, ready 
for indexer review and selection. This process uses the computer to do the 
repetitious, uninteresting, time-consuming indexer tasks that are largely 
mechanical, and provides expert, consistent translations for a final review 
by humans. 

Indexer Feedback . The indexer reviews the suggested target terms and 
provides the lexical dictionary personnel with recommendations for improved 
translations. These recommendations are studied, usually approved, and the 
needed changes are made. These changes improve the indexer simulation for 
subsequent runs of the program, but the system uses humans to make 
decisions not yet possible with available software. Humans also upgrade 
the computer system which results in improved indexer simulation for 
subsequent runs of the program. 

As in the old chicken and egg go-round, each produces the other. Both 
are essential, and they work in a kind of endless loop. Together the 
system has the best of both automatic and human input, and it keeps 
building on itself, hence an expert system. 


System Functions 

As stated before, the NASA Lexical Dictionary system is a translation 
device. The NLD translates words and phrases from machine-readable input 
material into corresponding NASA Thesaurus posting terms. The mode of 
operation, either Phrase Matching or Subject Switching, depends upon the 
type of input material being processed. 

• The Phrase Matching mode is a general purpose matching routine 
which attempts to find context sensitive word-by-word matches 
between any input phrases and NASA posting terms or Use 
references. Matches may be complete or partial. In some cases, 
no match will be found. For example: 
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Input Phrase 
Salaries 

Fuel consumption 
Inorganic acids 
Cellulose acetates 
Chance-Vought military 
aircraft 


NASA Posting Term(s) 

No match found 
Fuel consumption 
Acids 

Cellulose, Acetates 
Chance-Vought aircraft, 
Military aircraft 


The Subject Switching mode is a special purpose routine which 
translates the concepts expressed by the posting terms assigned 
to a document by a particular contributing source (such as DTIC) 
into the equivalent concept expressed in NASA posting terms. 
Subject Switching treats each input posting term as a unit, in 
contrast to Phrase Matching where the unit is the word. A unique 
translation table is built for the posting terms of each 
contributing source. An entry is created for every contributed 
posting term, but in some cases, the translation may indicate 
that the term is out of scope or not able to be translated. For 
example: 


DTIC Posting Term(s) 

Regiment level organization 

Complementary metal oxide 
semiconductors 
Internal combustion engine 
noise 

Abrasion, Resistance 
Self treatment 

A more detailed explanation of these 
the section on DATA FILE MAINTENANCE. 


NASA Posting Term(s) 

NIS (Not In Scope for 
NASA) 

CMOS 

Engine noise, Internal 
combustion engines 
Abrasion resistance 
00 (No NASA translation) 

two translation modes is provided in 


The Lexical Dictionary system can be used as the basis for an 
automatic indexing system to process text fields, such as abstracts. 
Automatic indexing, if it were instituted, would require the addition of a 
word recognition file to assign syntax codes and a program or programs to 
break text into logical words and phrases for processing. DTIC uses a 
similar system for automatic indexing of several data bases. 


System Components 

The NLD has three major components: data files which act as 
translation tables, an Access Routine which manipulates the input words and 
phrases, . matches them against the data files, and returns the NASA 
translation to the application program, and applications programs which 
call the Access Routine. 


Figure 2 gives an overview of the NLD system operation, 
section will describe briefly the three NLD components. 


and this 
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Data Files . The Lexical Dictionary system employs two types of Virtual 
Storage Access Method (VSAM) files: 

• A general purpose Phrase Matching file and 

• special purpose Subject Switching files for the controlled 
vocabulary (thesaurus terms) of each contributing source. 

The file organization and record layout for both types of files are the 
same. Each NLD file record consists of the following fields: 

• Key 

Each key is unique and consists of terms that may be encountered 
in the input material. The key can consist of a single element, 
followed by a semicolon and two zeros (;00), or of multiple 
elements separated by semicolons (;). In the Phrase Matching 
file, these elements are the individual words that make up the 
target vocabulary posting terms or Use references. In the 
Subject Switching file, each element is an entire posting term 

from the vocabulary of the contributing source for that file. 
Terms may be single or multiple words. 

• Logic Code 

The Logic Code is a one character code that indicates how the key 

is to be processed. Single element keys are assigned one of the 
following logic codes: 

E - (Equal) The key translates to a single posting term 

that is identical to the key. 

C - (Change) The key translates to a single posting term 
that is different from the key. 

L - (List) The key translates to multiple posting terms 
that should be used in combination. 

I - (Indexer Choice) The translation of the key is context 
dependent. The meaning appropriate for the document at 
hand must be selected and a choice of posting terms is 
offered. 

0 - No translation is available for the key. 

When there are multiple elements in the key, the logic code T 

(Table) is always used. 

t Posting Term 

The posting term field contains the NASA posting term or terms to 
which the key is to be translated. The field may also contain 
the following special symbols, which serve as an aid to indexers: 

@ - NASA posting term is an array or ambiguous term. 


10 


> - NASA posting term is broader than the contributing 

source term in the key. 

+ - NASA posting term has narrower terms that the 

indexer should consider. These are terms 
that the contributing source does not have. 

? - Indexer should choose one or more of the NASA 

posting terms as appropriate. 

00 - No appropriate NASA translation is available. 

NIS - The contributing source term is NOT IN SCOPE for 

NASA. 


In some cases, there are more than two elements in a key. NLD System 
processing requires that intermediate records be created which build to 
these multi-element keys. The first entry will consist of the first two 
elements. Each successive entry will add one more element until the entire 
phrase is complete. Since the intermediate keys do not have translations, 
the posting term fields for these records contain a special— symbol as a 
place holder. For example: 

Logic Code Key Posting Term 


T Body;Centered * 

T Body ;Centered;Cubic ** 

T Body ;Centered;Cubic;Lattices Body centered cubic lattices 


Samples of records from the Phase Matching file and the Subject Switching 
file are shown below: 


Phrase Matching File Sample 


Logic Code 


Key 


Posting Term 


E 

C 

E 

T 

T 


Bleeding;00 
Blends;00 
B1 i gh t ; 00 
Blind; Landi ng 
Block;Band 


Bleeding 

Mixtures 

Blight 

B1 ind landing 
Block band 


(Logic codes I and 0; and symbols >,?,+, 00, and NIS are not normally 
used in the Phrase Matching file.) 


Logic Code 

E 

E 

C 


Subject Switching File Sample 


Key 

Filters;00 

Financial management;00 
Fi ngerna i Is; 00 


Posting Term(s) 
Filters 

Financial management 
Fingers 


■U. 


11 


Logic Code 


0 

E 

L 

I 

T 


Subject Switching File Sample (continued) 


Key 

Fingerprint recognition ;00 
Fins; 00 

Fire alarm systems ;00 
Fire protection; 00 

Floating bodies ;Sea ice 


Posting Term(s) 

00 

Fins+ 

Fires, Warning systems 
Fire prevention?. 
Fireproofing? 

Ice floes 


Lexical Dictionary Access Routine . The NLD Access Routine is a 
general purpose program that accesses the Lexical Dictionary files. Its 
product is a list of index terms from the NASA Thesaurus which was the 
target vocabulary. 


The Access Routine never operates independently; it is always called by an 
application program. The application program passes the Access Routine two 
things : 

• a code that indicates whether the Phrase Matching or Subject 
Switching mode should be employed and 

• a character string that is either a word or phrase for Phrase 
Matching or the set of posting terms assigned to a record by a 
contributing source for Subject Switching. 

As the first processing step, the Access Routine creates an array from the 
input character string. For Phase Matching, each word in the phrase is 
treated as an individual element, and the words are left in the natural 
order of the phrase. For Subject Switching, each posting term (which may 
be single word or multiple word) is treated as an element, and the posting 
terms are sorted in alphabetical order. 


The following examples show a Phrase Matching and Subject Switching 
input array: 

Phrase Matching Subject Switching 

Input phrase: Engine Endurance Input DTIC Posting Terms: Engines, 
Testing Research Laboratories Laboratory Tests, Endurance (General), 

Laboratories 


Phrase Matching Array: 

Engine 

Endurance 

Testing 

Research 

Laboratories 


Subject Switching Array: 
Endurance (General ) 
Engines 
Laboratories 
Laboratory Tests 


Aside from the initial difference in creating the input array, processing 
by the Access Routine is basically the same for the Phrase Matching and 
Subject Switching modes. A general description of this processing may be 
found in Appendix A. 
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Add! ication Interface Programs . The NLD system is designed so^that 
the application program determines "the translation mode to De used and the 
f i 1 es P to C be accessed . The Access Routine performs a standard processing 
routine based on these requirements and returns all matches that it. finds 
to the application program. The application program determines w ~ 
the itches will be used. Because of this design, adding new applications 
or rtw wi of existing applications doe s not genera y 
require changes to the NLD system itself. Normally only the applicatio 
program must be created or modified. 
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HISTORY 


DTIC's Role 

Paul K1 ingbiel , first director of the NLD Project, had been active for 
18 years in linguistic research at DTIC. While there, he had initiated a 

system! dlCtl0nary which became P art of DTIC's machine-aided indexing 

ot he r^ a aen cTe s^ 66 n i n 1 u C ^ n 9 duplication of work done by 

naca i a 9? ncies ; In 1981 > it was decided to move ahead with plans for a 

nTT?^ 10 ^ 19 ^ * designed to switch automatically the subject terms 
selected by DTIC's indexers to NASA terminology. 

rnnii 1 l n / b !! 1, i by ^ n retired from DTIC, agreed to organize the project. 
Copies of the. lexical dictionary software were obtained from DTIC and 
programmer Elaine Sellman began a study of NLD requirements. ’ 

Facll?ti C .iL5 r i 9 H a <f S f fl r er t written in C0 , B0L for a UNIVAC mainframe while the 

So althouoh t h p nTir c fE r ° 9rammin9 ,a ? a 9 e ' PL1 > and an IBM mainframe, 
io, although the DTIC software was available, it served primarily as an 
example and the basis for the new NLD programs. primarily as an 

. . A DTIC's lexical dictionary file also was obtained. This was 

used to determine how DTIC would translate NASA posting terms into DTIC 
posting terms and was helpful in constructing entries thaT translated 
coordinations of DTIC terms into single NASA terms. 

NASA KWOC and Data Entry 

„ „ K 1 i n 9 n ie1 began the NbD with a 1ist of NASA posting terms in a SDecial 
Key Words Out of Context (KWOC) format. A KWOC listing had been used at 
DTIC to review and correct inconsistencies that had entered into the 
nf Language Database. By starting the NLD with a KWOC printout of all 
of NASA s Posting terms and Use references, the problems experienced at 

entries ^ Ke^Fi* " 1 *°}’ the KW0 5 became the basic tool Tor coding NLD 
entries, (See Figure 3 for a sample page of the NASA KWOf ) pninmn l 

lists the unique words in the NASA controlled vocabulary in alphabetical 

order. Column 2 shows all NASA terms and Use references that are in the 

Thesaurus and that contain the word in column 1. Column 3 lists only NASA 

Posting terms These are either the same terms that appear in col 2 2 or 

authorized NASA posting terms that are to be used for those in column 2 ! 

Entries for the Lexical Dictionary were selected from column ? finiu 

“tli™ In^tUTraf tUleTeaed^'Sing 

word' 1n S co(umn e i hatched. w ' lere the "° rd f " ^ * *" d the unique 

For example, in Figure 3, note the term OPERATIONS in 
column. It matches the word OPERATIONS in the first column and* should b* 
posted to the term appearing in column 3, namely OPERATIONS. The first 
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OPWOmI Dfr.SiTY OPTICAL DENSITY 

OPTICAL DEPOI APIZATION OPTICAL DEPOLARIZATION 

OPTICAL EMISSION LIGHT EMISSION 

OPTICAl EMISSION SPECTROSCOPY OPTICAL EMISSION SPECTROSCOPY 

OPTICAL EQUIPMENT OPTICAL EQUIPMENT 


rfsfad™ t I 1e te * rn l 1rnni u ed ^tely following OPERATIONS, i.e. OPERATIONS 
tn SE thf H % a S ° matche ? the word 1n column 1, and this item should be posted 

OPERATIONS 6 RESEARCH^ 1 °" ^ in “lunm 3,^ 

The KWOC listing also was used to determine the proper loqic code In 
the case of OPERATIONS in column 2 which is posted to OPERATIONS in column 
3, it would appear that the two are equal and the logic code should be E 

RESEARCH "consfst^nf neX J after 0PERATI0NS « i.e. OPERATIONS 

f -\ W u words makln 9 two elements in the key to the 
record. For any key with two or more elements or for any sinqle element 

T Ch ? s „ the f I rst of a longer key, the logic code 

the pISpe'r logic code. *“° C he ' Ped the PerSdn COd1 " 9 entries t0 se,ect 

q Entries for the Lexical Dictionary were coded for keyounchina 
Specially printed coding sheets were used (see Figure 4) to keep the 
various parts of the entry in the proper columns Three lines (and 
^ hre ? cards) were required for each one- or two-element key. For 
flnd\drtl!i t1 t na \-h Wnr 5 ’S 3 three add itional cards were coded, punched, 
nMmho^ dd d Tk t0 4 r - the -P deck * A11 cards contained an identifying five diqit 
■ first four digits were assigned consecutively except that the 

999 6 a , P H Peared ° n three card ^efore the number changed WhSn 

:r 9 ; ^ eached » the sequence returned to 0001. Since the oriqinal record 
that had been numbered 0001 had already been loaded onto magnetic tape the 
duplication of numbers was not confusing. The final or fifth digit Sf the 
identifying number was either a 1, 2, or 3. It indicated which of the 
three parts of the record the card contained. All cards with numbers 
ending in 1 contained the logic code. For a one- or two-element term card 

wo 1 er C oe" 2frd t 3 e h f ;i5 St fh e ' e ’" e T- Card 2 .stained the s S efelenf of 
thLf er06S ’ i d 3 he d the P° st1n 9 term for that record. For terms with 
th^fiVctTwr card 3 contained a continuation symbol, card 4 hefd 

L? tw ° dements (separated by a semicolon), card 5 furnished the 

annth eleme . n *» and car d 6 the posting term for a three-element kev or 
key! h ^d C s°rjn n . Uat1 ° n 1f a " y additional words were required for the 

.. + 1 ^ *i c j n be , se , en that for 3 seven-word term -- the lonqest in the NASA 
controlled vocabulary — it was necessary to code and keypunch (n-l)3 cards 

n F^^L%e^. n ^Lr w ^ s rd a%r„^a n na^e a f^ r ?Iir j ?bf 0ta1 ot 18 

compllfated thff S those t used r fow b and 9 confined so^e^ddufon^^fntefHgence? 

te™ S ted tl;l t T 1 or TT ab nn fl'T- ‘ he k NASA . Posting consisted of twf or j£fe 
entry ’would be followed by ar^L ma°king ^hVToU*^ f ° r ^ 

that stopping the manual process to reduce the manual procedures to program 
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fK C ! f I Cat !° nS ii bHef the P ro 9 r 3 m niers, write, test, and debug the programs, 
and automatically generate the NLD entries would take more time and be less 

^L ef ,?H C V Ve “T f1nish . 1n 9 the job manually. Therefore, the manual 
coding and keypunching continued. (For the next effort, candidate entries 
were created automatically.) 

The primary job of coding and keypunching entries was finally 
completed, but since there were a number of errors to be corrected, Sellman 
devised a way of doing this online to speed up the process. 

. . During the time when the entries were being coded and the data entered 
into the file, the records were changed from four fixed-length fields 

V 9 AV 1 n fi ^ °? 1C ^ C - ode \ u irst e . 1ement ’ last element, and posting term, to a 
VSAX file containing three fields: the logic code, the key, and the 

term i*H Th ^ e key . for each record was and is unique. Any record in 
I!* f j! e could an . d can De replaced by overlaying another record having the 
. In this way, logic codes and posting terms can be changed. If 
the error is in the key, it is necessary to delete the record and add it in 
its correct form. 

n 4 nnk'Li the f in 3° f 198 l 2 there were some Personnel changes. On April 1, 
Klingbiel retired from the Facility, but was retained as a consultant to 
the project. June Silvester became assistant and acting project director 

J ° b W3S ta * en ^°! e 1 r in late Mdy for e1ght weeks by Ron Buchan while 
v te n r .; as on extended leave. In the meantime, Edna Fleek completed the 
job of getting the file ready for use. The excellence of her and Buchan's 

confidence nasa “» 
operational^ Mi ^0 oLVth.^ 


Matched 

Posting term 
Use reference 


DTIC Posting Term 

DECODING 

DECOMPRESSION 


NASA Posting Term 
DECODING 

PRESSURE REDUCTION 


The June progress rv,ort on the NLD included the following statements: 

S e nnn X - Ca !i. D1 S^? n ?r y now has about 14 ’ 000 records out of a projected 
’ °° nf ln N A tbe N t ASA ^esaurus. After the NASA terms have been coded, a 
tape of NASA. Terms will be made that can be run against the NASA 

terms 91 Dlct10Pary t0 det ermine misspelled terms as well as missing 

Of the computer identified errors, over 300 have been corrected with 
made 9 usTna "the" TSff • ent £ y k «; yin 9- Nearly 200 corrections have been 

laSor of the Sid ' entry Pr03r£ ™ Wh,Ch c0 " sun,es 1/3 of the 
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A recovery command was developed for the TSC entry system for the 
Lexical Dictionary, enabling the entry of data more than once a day. 

First Operational System 

The June report also stated that: 

The Access Routine was tested and has proven workable leaving only 
questions of format to be considered. This means that we have 
actually achieved subject switching between DTIC and NASA terms. 

On the other hand, Klingbiel's September trip report stated that: 

At this point in time there has been no Subject Switching with either 
NASA or DTIC data, except in the most trivial and incomplete sense, 
because neither file as now constituted contains Subject Switching 
data. Subject Switching cannot occur until the present NLD is 
upgraded with data to be obtained from successful DTIC/NASA, NASA/DTIC 
runs. 

This seeming disagreement with the statement from the progress report 
stemmed from a misunderstanding as to the nature of subject switching. We 
reiterate that subject switching is translating concepts expressed by one 
or more posting terms from the controlled vocabul ary of a contributing 
organization to the same concept expressed in the posting terms from the 
target vocabulary, also controlled. 

The system had achieved the capability of matching input phrases, 
character by character -- the first operational segment of the NASA Lexical 
Dictionary system -- but the translation of concepts was instituted later,. 

In early September 1982, the NLD file was transferred from magnetic 
tape to disk files. Also programmer Sellman left the Facility, turning 
over the NLD development to Duchesne Clark, assisted by Midori Keech. 

Klingbiel visited the Facility September 13-24, ironing out problems 
that had arisen during the summer and laying out in detail the steps to be 
taken before his next visit in December. These tasks were carried out ty 
the NLD team of Buchan, Fleek, Silvester, Streeks, Clark, Keech, and 
programmer Patricia Carroll who joined the project in September. The tasks 
included updating and slightly changing the DTIC Lexical Dictionary, fixing 
a problem that had been discovered with the way in which glosses were 
handled, updating the DTIC thesaurus authority listing, doing many error 
checks and corrections, and finally producing four printouts and a copy 
each of the NLD and DTIC 1 s Lexical Dictionary. 

The first of the four printouts was the result of running DTIC ' s 
posting terms through the NLD which, so far, consisted of just one file and 
a program that could phrase match. This program provided a printout of 
DTIC terms and matching NASA terms, not only when the entire DTIC term 
matched, character for character, but also when only part of the term 
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matched. The listing was sorted by the input posting terms, in this case 
DTIC S. 


The second printout was the same information but sorted by the output 
(NASA's posting terms). 


The third printout was the result of 
through the new version of DTIC's lexical 
sorted by DTIC's posting terms (the output). 


running NASA's posting terms 
dictionary. The printout was 


. Finally, the fourth printout was the same as the third but sorted by 
NASA's posting terms. 


Collectively these printouts totalled over 2,500 pages. When 
K1 ingbiel returned to the Facility on December 6, 1982, it was determined 
that a more compact presentation of the data was required in order to 
expedite analysis and data entry. 


Discussions with Clark resulted in some changes and reprints of the 
four printouts. To avoid cumbersome nomenclature, the printouts were 
referred to as Books 1 through 4, and identified as follows: 


Book 1 
Book 2 
Book 3 
Book 4 


DTIC/NASA sorted alphabetically by DTIC terms 
DTIC/NASA sorted alphabetically by NASA terms 
NASA/DTIC sorted alphabetically by DTIC terms 
NASA/DTIC sorted alphabetically by NASA terms 


Two of these books were re-sorted. The re-sort analysis conducted by 
Clark resulted in another software change and finally five copies of each 
book were printed on 8 1/2" x 11" photocopy paper for use by the NLD team. 

... "Jj® conclusion of Klingbiel's visit on December 10, 198? coincided 
with the announcement of Roxanne Newton's appointment to the position of 
project director. She had joined the project on November 29. 

Implementation of Subject Switching 


„ Second Operational System . The data analysis tasks that were to 

occupy the next few weeks were identified and assigned as follows: 


• Analysis of DTIC terms with no mechanically derivable NASA 
counterpart (Buchan, Streeks). 

• Identification of identities between NASA and DTIC terms (Fleek, 
Newton). 

• Compilation of Tables, i.e. coordinated DTIC terms (Silvester). 


iqq-j Anothei " K1 ingbiel visit to the Facility was scheduled for January 3-7, 
iy83. In the . meantime, the team did some analysis and obtained some 

«S d 4;s2dT N «X C s e te™s. tranS,atfn8 DTIC C °" CeptS *° the saTO concepts 
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Klingbiel recommend ed that as the assigned tasks were being carried 
out, the team: 

1. Note anomalous machine translations for subsequent evaluation. 

2. Evaluate alternative data entry methods. 

3. Collect pertinent statistics which would help in estimating the 
total workload. 



Newton recognized and pointed out that the NLD entries selected from 
the KWOC could be identified even more easily from the NASA Thesaurus. 
This is because logic codes are determined by the initial word position and 
the presence or absence of significant following words. That is, 
significant words in the medial o>* final positions in a posting term or Use 
reference were of interest only to tne extent that they existed or did not 
exist. 

A new data entry method was devised and instituted by Newton, 
Silvester, Clark, and Carroll. At the time of Klingbiel 's December visit, 
the four books of data had been categorized by the type of match that they 
supplied between the DTIC and NASA vocabularies (i.e., no match, exact 
match, change, and coordination -- or tables). Except for the "no match" 
entries, each kind of data was transferred to a dataset that could be 
edited online. Building the datasets in this way kept the files accurate 
since the input had been checked and corrected repeatedly throughout the 
fall months. 

The data in the four printouts, books, or datasets presented a variety 
of problems - most of them anticipated. For instance, "no matches" were 
expected because DTIC ' s and NASA's vocabularies are designed to support two 
different missions. Human analysis of the "no matches" was able to resolve 
about 80% of the cases leaving 20% of DTIC ' s terms with no translation. 
These were zeroed out. As expected, problems in generic level occurred in 
two ways: DTIC had specific terms for which there was no equally specific 
NASA counterpart and vice versa. 

A problem not explicitly recognized prior to the acquisition of the 
four books of data was that which was presented by chemical terms. DTIC 
uses a highly coordinated (Boolean) method of indexing with chemical terms 
that can produce signi Meant false coordinations when more than one 
chemical term is indexed for the same document. No obvious solution was 
apparent. 



It was noted in Klingbiel 's January trip report that about 10% of the 
data had been analyzed, major problem areas and solutions had been 
identified, an efficient data entry technique had been devised, and 
anomalous data had been noted and either deleted or corrected. 

The translations of individual DTIC posting terms to NASA posting 
terms continued as assigned. 
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Meanings of all DTIC terms were examined. Meanings of terms that 
appeared to be identical were compared and translations corrected when 
homonyms were discovered. The evaluation of candidate coordinated entries 
also was begun, as was internal documentation and a preliminary study and 
test of the NLD. As part of the study, the indexers were interviewed 
individually and confidentially. In addition a test NLD was created, 
enabling a comparison of DTIC, NLD, and NASA-indexer indexing for a sample 
of 100 documents. 

By April 1, 1983 all DTIC terms had been examined and a translation 
for each had been entered into the DTIC Subject Switching file. With the 
loading of these entries into the NLD, the DTIC tapes could be run through 
the second operational system. That did provide Subject Switching on a 
limited basis. 


The entries consisted of the following: 


Type of Entry 


Number Coded 


Exact Match 5400 
Partial Match 4500 
No Match 3200 


Third Operational System . By April 28 all of the 6,300 table — or 
coordination — entry candidates had been examined. Over 3,000 entries 
were accepted as presented. Others were accepted with additions or 
alterations. The remainder were deleted. The table entries then were 
loaded into the NLD and full Subject Switching became not only available 
but also operational. 

Review and Feedback . The final phase of developing the DTI C/NASA 
Switching capability of the NLD system began at the end of April 
1983 and is ongoing. This consists of adding and revising entries based on 
feedback from the NASA indexers and on a systematic review of the file by 
the Lexical Dictionary staff. 
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PROCEDURES FOR BUILDING A LEXICAL DICTIONARY 

° f UMCa1 DfCtl0 " ary Im Ple»>entat1on for Subject Switching 
components N of A thWo^ystan: ^ alnar,y developed the following major 

• file structures for Phrase Matching and Subject Switching, 

• entries ,^ r ° Ce< * UreS f ° r Ph ™* ^Mn S and Subject Switching 

• programs for generating candidate Subject Switching entries, 

• the Access Routine program, 

• online file maintenance and validation programs, and 

• application programs suitable for tne Facility's uses of the NLD 
following efforts woul ™be Vequl red? sj,stel " for another organization, the 

and t h e 5 o n ° i n e mTl’nte nance' a°n d valid??’ the AtCeSS Rout ' ne > 
a different host system, 6 d valldat10n Programs to run on 

organization 1 s^ses^f ’ the^NLD system^and SUitdble for tha * 
data | il ° e f s> translatlon entries to create the Lexical Dictionary 

NLD sys tern ^^a n^be 60 * inpl^n^nt e<? in^W VO h cabular ^ t0 another using the 
overview of these four phases ° Ur phases ‘ Fi 9 ure 5 Presents an 

the target vScabul" ary S ( the ^ocabuTa^^^ of a Phrase Matching file for 

translated). This file ‘ consists of 7ntH« ? 11ch input Phases are to be 
reference in the target thesaurus ^ wfi/ ev ® r y posting term and Use 

constructed specifically for the NLD svstpm additional Use references 

be coded manually or a prooram r»n L T he entries for the file can 

Ph^p a n 1Cally D u from a machine-readable file ofTh^th t0 9enerate th em 
Phase One or Phrase Matchina filp th* Min 6 . thesaurus. Using the 

input term or phrase with entries in the^fwf 11 W1 'V attempt to match any 
target vocabulary posting terms. the fl e and tra "slate them into 

basicany P a aS t ra Jsl°a t tJn uVutetwee^th? fi L e . is be 9un. This file 1s 
source (the input vocabulary) W^po%„t*Sa ^Vr»T tPlbUt1 " > 


23 


tfimitfiMirri t 




Figure 5 

Overview of Lexical Dictionary Implementation for Subject Switching 
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vocabulary. Entries in the file pair each input vocabulary posting term 
with the posting term or terms from the target vocabulary that express the 
equivalent concept. Candidate entries for this file are created by 
processing the input vocabulary posting terms through the target vocabulary 
Phrase Matching file created in Phase one. Analysts then evaluate and edit 
these entries to create the final Subject Switching file. A separate file 
is built for each input vocabulary to be translated. 

Phase Three adds entries for coordinations between posting terms of 
the input vocabulary to the Subject Switching file created in Phase Two. 
These coordination entries represent two or more posting terms from the 
input vocabulary which, when used in combination, translate to a posting 
term or terms in the the target vocabulary. One way in which Phase Three 
can be implemented is by creating a Phrase Matching file for the input 
vocabulary, processing the target vocabulary through this file, and 
analyzing and editing the resulting candidate entries. The completion of 
Phase three makes possible full Subject Switching from the input vocabulary 
to the target vocabulary. 

Phase Four is concerned with user feedback and file maintenance. New 
terms added to both the input thesaurus and the target thesaurus require 
additions and modifications to entries in the data files. In addition, 
users can supply feedback as to translations that should be added or 
modified. 

The following sections describe these four phases in detail. 


Phase One: Phrase Matching File 

Purpose . The creation of the Phrase Matching file makes it possible 
to attempt to match terms and phrases from any source (see the subsection 
on Purpose in the INTRODUCTION) with the target vocabulary. Additional Use 
references from varying forms of target vocabulary terms, such as 
singulars, plurals, spelling variants, and gerunds, also are put into the 
Phrase Matching file. The match capability of the system increases with 
the number of Use references in the file. The Phrase Matching capability 
can be used for any application requiring the translation of words or 
phrases into the target vocabulary. The Phrase Matching file is used in 
building the Subject Switching file and is an essential part of a 

machine-aided indexing system. 

Record Description . Each record in the Phrase Matching file consists 
of three fields: the logic code, the key, and the posting term. 

The logic code in the Phrase Matching file is entered in the first 

column of the record. This code is selected according to prescribed rules 
and provides a weak form of syntax for use by the Access Routine in its 
search for multi-element terms. The logic code also indicates the 

relationship between the key and the posting term(s). 

The key consists of one or more elements. In Phase One, these 

elements are the individual words that make up the target vocabulary 
posting terms or Use references. A single element key will end with a 

semicolon and two zeroes. The key for each entry must be unique and must 
be combined with only one posting term field. Input for the Phrase 

Matching file consists of the target vocabulary posting terms, thesaurus 

Use references, synonyms for and variants of the terms, which become 
additional Use references. 

The posting term field contains one or more posting terms from the 
target vocabulary. When an input word or phrase matches a key, it is 

translated to the term or terms in the posting-term field. 

For each entry in the Phrase Matching file, it is necessary to 

determine the key, the posting term(s), and the logic code. 

Key. The key of the record being constructed is unique. It is the 

subject of the record and consists of the words of the term or Use 
reference being described. The Phrase Matching file is based on keys 
created from the target thesaurus posting terms and Use references. 

Additional Use references, such as singulars and plurals, may also be 
added. Each word in the posting term or Use reference is a separate 

element in the key. 

• If the key consists of only one word, a semicolon and two zeroes 
are added following this single element. For example: 

Term: Controllability 

Key: Control labil 1ty;00 


• If the key consists of more than one word, the words (or 
elements) are separated by semicolons. For example: 

Term: Geological surveys 

Key: Geological ;Surveys 

• If the key is identical to the first two or more elements of a 
longer key, then in addition to separating the words by 
semicolons, a semicolon and two zeroes are added following the 
final element. For example: 

Terms: Charge transfer devices 

Charge transfer 

Keys: Charge;Transfer;Devices 

Charge;Transfer;00 

Somo specific formatting rules follow: 


• Hyphenated words or two words separated by a slash are treated as 
a single element. For example: 

Terms: Government/industry relations 

Key: Government/industry Relations 

• An ampersand (&) is treated as a word. For example: 

Term: Atmospheric & Oceanographic Information System 

Key: Atmospheric ;&;0ceanographic;Information;System 

• Parentheses are dropped from around words in the key. For 
example: 

Term: Hudson River (NY-NJ) 

Key: Hudson;River;NY-NJ 


Posting Term. The posting term field represents the target 
vocabulary s equivalent of the elements that appear in the key. The 
posting term or terms are entered exactly as they appear in the target 
vocabulary thesaurus. In the Phrase Matching file, posting terms listed in 
the key field are posted to the same term in the posting term field. The 
Use references in the key field go to one or more valid posting terms in 
the posting term field. Multiple posting terms are separated by commas. A 
space is left between words in a posting term, but not between multiple 
posting terms in the posting field. For example: 


Ke^ 


Posting Term(s) 


Controllability;00 
Chrome ;00 

Geoastrophysics ;00 
Geological ;Surveys 
GoldjPlate 


Controllability 

Chromium 

Astrophysics, Geophysics 
Geological surveys 
Gold coatings 
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Ke^ 

GoldjOO 

Government/ industry Relations 
Hudson;River;NY-NJ 


Posting Term(s) 

Gold 

Government/industry relations 
Hudson River (NY-NJ) 


Logic Code . The logic code indicates the relationship between the key 
an d toe posting term. The first three logic codes are used with single 
word keys. E indicates that the single word key and the posting term are 
F r ,dAL or exact matches. For example: 


E Control labil i ty;00 Controllability 

C indicates that the posting term shows a CHANGE from the single word key. 
For example: 


C Chrome;00 Chromium 

L indicates that the single word key is posted to a LIST or multiple 
posting terms. For example: 

L Geoastrophysics ;00 Astrophysics, Geophysics 

If the key contains two or more words, the logic code is a T. The T refers 
to the TABLE format of the coded file entries. For example: 

T Geological ;Surveys Geological surveys 

T GoldjPlate Gold coatings 

T Hinged;Rotor;Blades Hinges, Rotary wings 

Continuation Entries . When a key exceeds two words, special 
continuation entries must be made for use in NLD system processing. The 
key for the first of these continuation entries is made up of the first two 
words of the term. The next key is created by adding the next word from 
the term to the key. Additional entries are created in this way until the 
entire term appears in the key. 

A symbol is used in the posting term field to indicate that the 
program must continue to look for additional key elements in order to reach 
the proper posting term. The format for the entries required for a term of 
multiple words is a table. For example: 

A term consists of seven words, ABCDEFG, and it is to be posted to a 
term of three words, HIF. The entries are as follows: 


Logic Code 

T 

T 

T 

T 

T 

r 


Key Posting Term(s) 


B 

★ 

B ;C 

** 

B ; C ; D 

% 

B ;C;D;E 

%% 

B;C;D;E;F 

%%% 

B;C;D;E;F;G 

HIF 
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The asterisks and percent signs in the posting term field not only tell the 
Access Routine that additional elements must be located, but also tell the 
analyst how many elements belong in the entry, how many entries the term 
requires, and in the case of omissions, which entries need to be added. A 
program is available that will create the continuation entries, so they do 
not need to be manually coded. 

Spscial Symbols . In addition to the asterisk and percent siqn, 
discussed under Continuation Entries , other special symbols may be used in 

a r e ft"- a given application. For 
example, the NASA Thesaurus designates certain ambiguous or very broad 
terms as Array terms. The Thesaurus recommends use of a more specific term 

f?pl P J a nf t°hl Dh 6 Arr M y A e ™‘ " hen ? ese terms a PP ear in the Posting term 
lilt f i th ? Ph r? se . Matching file, they are followed by the ? symbol. This 
symbol alerts the indexers to the fact that the posting term is an Array 
term. For example: J 


E AnalysisjOO 

E Lifts; 00 


Analysis? 

Lifts? 


Coding for Input , the NLD system has an 
for adding new entries to the file. For online 
as follows: 


online update program used 
update, the entry is coded 


Logic code$Key$Posting term 


Elements in the 
are followed by 
The "$" indicates 


key are separated by semicolons 
" *00". Multiple posting terms 
the end of a field. 


, and single element keys 
are separated by commas. 


Examples of Entries Coded for Online Update: 


E$Control labil ity;00$Control labil ity 
C$Chrome;00$Chromium 

L$Geoastrophys i cs ;00$As trophys i cs .Geophys i cs 
TSGeological ;Surveys$Geological surveys 
T$Hinged;Rotor;Blades$Hinges, Rotary wings 


When the entries are loaded into the Phrase Matching file 
are used as field delimiters are dropped. The fields are 
record as follows: 


the "$"s that 
entered in the 


• the logic code in Column 1 

• the key in Columns 4 through 127, and 

• the posting term in Columns 130 through 400 (variable length) 


h/^!n2 e f r iE ti0n 2? the p n?$? dures for codin 9 and loading new entries may 
be found in the section on DATA FILE MAINTENANCE. . ...... y 


Implementation. Figure 6 presents a graphic view of creating the 
iM?f f n?o C r e “': ary Phr “ e MatCh1 " 9 f11e ' 11 15 a sWe p"r 9 oc«s 
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procuring a copy of the target thesaurus, 
constructmg Phrase Matching entries from 
using the procedures just described, and 
oading the entries into the Phrase Matching file. 


the target thesaurus 


IS a e ^Js r : e ^;? u r n^i2: S a °S^^L P, 5^ e JsVn e g T r |a t C c 0 h de D " ly T ^ 

HoweJe?, °ba s e cf T o n \ h e^e x pe r i e n c e 6 a a 1 n e d ? e ’ Cr1 h Pt - 1 ,T ° f this Spmel?? 
this process can now L a o P t™ lt S A . , froni bul,d1 "9 the original file, 
file 'f the thesaurus is availahi. *. ,ar9e extent ‘ If a "»ch1ne-readable 
all of the entries for the posting tenB^Sd" inf. “ n J ten t0 9enerate 
thesaurus. Analysts would still h* ! nd Use referen ces in the 

references for variant form of thesaurus d te !l con f ruc t additional Use 
available that allows direct online data entr^Treplte^punc^r ” 

validation of the Phrase^atch^n^ATe ha nn been 1 wr ."' tt en that aid in the 
an alphabetical list of terms aoDearlnn .? i ne , . prel 1mi . nar y program compiles 
NLD file. These terms are ref effld ? 8 "tSf , P0St1 W en field of the 
terms. The Lexical Dictionary kevs anw ,^ exica ^ Dictionary posting 

authority files for thesaurus terms n 5 ° 9 t i 1 6 rmS re com P ared with the 
errors. Program ex, s\fo7the f# " P °“ 1b ' e 


Check 

Thesaurus posting terms 
and Use references 

Thesaurus posting terms 


Lexical Dictionary posting 
terms 


Against 

Lexical Dictionary 
keys 

Lexical Dictionary 
posting terms 

Thesaurus posting 
terms 


lems 31 D1ctW * P° st ’'"9 Lexical Dictionary 


keys 


To Locate 
Omissions 

Omissions 

Non-matches 

Non-matches 


software!" 34 ^* d 1 scre P anc( e S are corrected using online maintenance 

Matching file and tiie^capaMlIty^fo^ vocabulary Phrase 

phrases with target vocabulary terms. ase Matchlns ln P ut terms and 

used ifie'f o? I ow7ng r wTn^ be^ requiVed^ 6 manua, method of construction is 

describecfTn* 1 the settio'n Uud ReJordTscH*,™^ S ‘ r “ CtUre 

&nX te X C a e tntrylf\ A ew Pr °r!co m rds hat r ates a ,oad »>• 

records, and deletions o* records. ds ’ changes to existing 


1. 


2 . 
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3 . 


4. 


5. 


6 . 


Load Program - A program that loads the load file created bv the 
Online Maintenance Software Into the Phrase Matching file. y 

Access Routine - A program that accepts input words and phrases 

lcat ^ on P r °9 r am and returns the posting terms into 
which the input phrases translate. 9 t0 

Continuation Entry Generation Program - A program that creates 
more dements.* 00 “ tr, “ that are requ(red ?or'& ttEl? 

Matchln 9 File Validation Programs - A set of error 
Matching f^°^ rams that validate the entries in the Phrase 

rll on a‘ different Computer tem!^ 8 ^ 6 ’ b “‘ to 

7 - T«Vo\A? 

generate the Phrase Matching entries. t0 

appl i cations** n^ddltl on f to® bull dVng 6 candldat^^ubject ^wl 
S"«“eh , ?nJS!!Sd t Si. th “ " aPP ’ 1Cat, ° n PrP9ra ” 

effort, ° f 

sbs-2s?jSS SSF 5 sms. e 
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Phase Two: Subject Switching File for Individual Input Vocabulary Terms 

Purpose. Phase Two provides a limited Subject Switching capability. 
It involves the creation of a translation for every individual posting term 
in the input vocabulary expressed in terms of the target vocabulary. The 
input and output may or may not be the same words, but they must convey the 
same concept. Phase Two is geared to handle simple individual term 
switches such as those shown below, but not the complex coordinations that 

nT?rfM?crt eS c Se u d - in £ hase Three * The following examples are taken from the 
DTIC/NASA Subject Switching file: 


Logic Code 


E 

C 

C 

C 

C 

I 

L 


Key (DTIC Posting Term) 


Radar;00 

Adenine; 00 

Bases chemistry;00 

Carbon carbon composites;00 

Drilling machines; 00 

Estimates ; 00 

Fluorescent dyes;00 


Posting Term 
InAsa translation) 

Radar 

Adenines 

Bases (chemical) 

Carbon-carbon composites 

Boring machines 

Estimates?, Estimating? 

Dyes, FI uorescence 


Record Description , 
consists of the same three 
file: 


Each record in the Subject Switching file 
fields already described for the Phrase Matching 


• Key 

• Posting Term 

• Logic Code 


This record differs from the records in the Phrase Matching file in the 
following ways: 3 

• the logic code is recorded in the second column of the record 
rather than the first, 

• the elements of the Subject Switching key consist of posting 
terms (which may be single or multiple words) rather than 
individual words, and 


• the posting terms that constitute the elements of the kev come 
from the thesaurus of a contributing organization. 

The keys for all entries created in Phase Two consist of a single element 
followed by a semicolon and two zeroes. As stated above, in Phase Two 
these elements are the single and multi-word posting terms that make up the 
input vocabulary. Each key is unique because the contributing 
organization s posting terms are each unique. 

IofL P ° + S h\ n9 term f ?J d re P resen t s the target vocabulary posting term or 
te^inthe^ey" 625 ^ Concept equivalent to the input vocabulary posting 
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r«.,s srwrJra.” as 

s mt «nvr£«?ss, E,rt—- •• *•»-• 

F Code. Phase Two logic codes E C and i * . . 

srtes s’,s rh& “£y=- " 

spelled^eras Vh’^L^"? 1 ”* 65 - that each ° r 9*"f«t1on has Identically 

& sriT- ~ ~ ^ 


0E 

*E 


Europe;00 

Aircraft carriers ;00 


Europe 

Aircraft carriers 


so^%ha%l"fro a m te the th post?l!g tem'iS tlTJc t"* VT 4 shows 

" a > be singular, w^e^^rVe?^ ,n ’ rt te ™ 

* c Adenine; 00 Adenines 

The input term may have a different form of a word. For example: 

* c Bases chemistry;00 Bases (chemical) 

One term may have a hyphen which the other omits. For example: 

K Carbon carbon composites ;00 Carbon-carbon composites 

essentially t^Tsame^thing^ 1 For™!impVe? ^ 1npUt term ’ but U means 


K 


Drilling machines;00 Boring machines 


subject described 6 by the term?" 9 * ln the t6rm but n0t in the conce Pt or 

target bulary d are ^ecessary^o convey P0St ™ 9 te ™ 5 »« 

the Inpot vocabulary For example: ^ Same "' ean,n9 as the te ™ fr ™ 




Femoral arteries;00 Arteries, Femur 


isfthl $ cont /ins^n^onV elemen^whlch^n^hase^wo 0 ^ 65 °" 1 ** T » a ' 
from the input vocabulary, followed by a semicolon and two zeroes! 1 '" 9 ter ” 

Two new codes are used In Phase Two In the Subject Switching file. 
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Logic Code 01 indicates that the proper translation is context dependent 
and therefore indeterminate and must be an indexer choice. An 
indeterminate translation is flagged with a question mark. For example: 

01 Estimates;00 Estimates?, Estimating? 

The input vocabulary has only the term "estimates" to cover both of the 
concept of "estimates" and "estimating" that are found in the target 
vocabulary. The correct translation must be selected by the indexer based 
on the document at hand. 

In another case, the terms appear to be the same but have a slight 
difference in meaning. For example: 

01 Performance tests;00 Performance tests? 

The target vocabulary's thesaurus limits the use of "performance tests" to 
apply only to operating equipment. The organization contributing the input 
vocabulary uses "performance tests" for equipment, systems, or human 
performance. Therefore, the terms may or may not be equivalent depending 
upon the context. The indexer will have to choose. 

Logic Code 00 is the only numeric logic code used. Whenever a translation 
of a term from the input vocabulary is not wanted or when the target 
vocabulary does not have an acceptable translation, the logic code used is 
zero (0) . For example: 

00 Peer groups ;00 0C 

Key . As stated, in Phase Two, the elements of the key are terms from 
the input vocabulary, not the words cf a term as in Phase One. The key 
contains only one posting term, and two zeros are added as a place holder 
for the second element. An entry is created for every individual posting 
term in the input vocabulary. 

Posting Term . In the Subject Switching file, the posting term is 
selected by analysts familiar with both the input and the target 
vocabularies. The posting term field contains one or more posting terms 
from the target vocabulary or the codes 00 or NIS. The contents of the 
posting term field reflect the best translation that can be made of the 
concept expressed by the individual term from the input vocabulary which is 
in the key. Sometimes there will be an exact match between an input 
vocabulary posting term and a target vocabulary posting term. In some 
instances, the translation will reflect only the addition or subtraction of 
an "s" or a hyphen. In other cases, the term may change to a different 
term or to a list of terms. A translation may not be possible or not be 
wanted and the term is "zeroed out." The logic code is entered as zero and 
the posting term as two zeroes. A term considered Not In Scope is posted 
to "NIS". 

Symbol s . In Subject Switching, three new symbols are introduced into 
the posting term field in addition to the Array symbol (@) described under 
"Special Symbols" in Phase One. When one of these symbols is used, it 
immediately follows the term to which it applies. 


• Indexer Choice (?) 




The question mark, discussed under logic code I, is used when the 
proper translation is context dependent and therefore 
indeterminate. The indexer is presented with a choice of terms, 
each flagged with a question mark. 


Broader Term Translation (>) 

When the suggested target term is of a broader generic level than 
the input term, the Lexical Dictionary posting term is followed 
by a greater than" (>) symbol. For example: 

(4C Jugular vein;00 Veins> 


• Additional Target Vocabulary Narrower Terms (+) 

When the suggested target posting term has narrower terms which 
are not covered by the vocabulary of the contributing 
organization, a plus sign (+) immediately follows the tarqet 
posting term. For example: y 

HE Bolts :00 Bolts+ 

(The input vocabulary has no narrower terms to "Bolts", but 
the target vocabulary has narrower terms "Rock bolts" and 
Tie bolts". ) 


imni ^nl tat1 A n < f 1 9 ure 7 presents an overview of Phase Two 
implementation. A machine-readable file of the posting terms of the input 
vocabulary is required. This file is processed through the NLD system 

an^^Ph^P^rh- 009 ^ a - ry T aSe Matchin 9 file constructed in Phase One 
m»!L Ph Matching logic. For each input posting term either an exact 
match, a partial match, or no match is found. By computer proqram base 

e " tr1ss f ° r e«c? Ss and 
n 1 - tC n U * J. he . se flles are printed, reviewed by analysts, then 
It UrMnn i"?’ W tu" edltin 9 1S complete, they are loaded into the Subject 
switching file. The no-match group is printed and researched by analysts 

or are "zeroed out", that is, 
instances, new terms may be 
these terms. A no-match 

?Sr:n?rSub!ea 9 7:ucht e 9 \ a iVe. entHeS ^ ed1ted> the " are ,oaded ' ont ° 


These no 
possible. 
In a few 
translate 


target vocabulary equivalents", if 
translated to a posting term of 00. 
added to the target vocabulary to 
file is then created using the 


Validation . Programs exist for the 
completed Subject Swi-tching file: 


following comparisons in the 


Check 


Against To locate 


Input vocabulary posting terms 
Keys 

Lexical Dictionary posting terms 


Keys 

Input vocabulary 
posting terms 
Target thesaurus 
posting terms 


Omissions 

Non-matches 

Non-matches 


36 


ase Two: Creating 












software! rePanC ^ eS are f °" nCl ' correct1 °" s are made using online-maintenance 

™feSSsV c,!,as “K 11 ! 

desc rS^n^^e: 100 ' 5 - Pha “ T “° de#e, ° pre "‘ -quires four programs 


1 . 

2 . 

3 . 

4 . 


Phrase Matching File - now completed 
Online Maintenance Software 
Load Program 
Accpss Routine 


In addition, Pi Q $e Two development requires: 


5 . 


Subject Switching Build Program - A set of programs which »mr»« 
Phrase "iht'cMnTf fl/ilid freaks ; inPUt P ° Stin9 tenns thr ° uph the 




a file of candidate entries for exact matches, 

! nr ntnL C ^ dldate . e u tn ’ es for part1al etches, and 

a printout of no matches. 


6 . 


7 . 


Software for editing the files of candidate entries - software 
WVtBufisXlpfuK etHM " 9 capat,i,lt1es such «s TSO, SPF, or 

Subject Switching File Validation Programs - Error checkino 
routines which validate that there is a key to match every input 
posting term, that all elements of the key are WuI 

v P aft"t 9 a£r^t"?ng th ^]! ,n tha **» 

S “ 1tChinp ap ™on, « 

two m Slr ter^ & ^ 

s p ;^;: s s„rt e g d raJ:f„g p ^ s&ri„,r a 

! s? , 3nrt ng th2 


8 . 
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Phase Three: Subject Switching File for Coordinated Input Vocabulary Tents 
voca C b^^^' ( on Ph °^ e ™***^'® 0 ""^“* ra *°“ b0un ' a ^ an pP^'h9 teras^fnto^target 

sSuchlng^cS’HVf™ 5 ' C °" P,eti °" ° f PhaSe Th ™ ^ ™ ® 

switcff^rf^-n j&rrSnM. s-sw s “*2 

^ " saw 

5* iMf AM& fr^ttoe 

» •M.ff S1SSS? S 

s&r to the ^^^pptiorofTnpu^TostiSrter^Shf^;. 0 "^? 


\bl Accident investigations; 
Aircraft 


Aircraft accident 
investigation 


thl Voo h 1c en r t o'I in th t !!% !>UbJeC !, Switching file, It Is necessary to determine 

!KJsS£a««K 

Logic Code. The logic code is always 0T. 
anaiJrt^’ Det ? rminin 9 the key is a decision-making process performed bv an 

Sir af;iv? T i £&«r o»„f"b JSi 

i C r 00 tems a in d) th COnVey t the 

=»SS iKSw -v swas 

opti on"'?or"Thase 1 T h ^e Fi9 Thr 8 n P t raSe,ltS a - n overv1ew ° f me implementation 

vocabui'ar^^Phras^^a^chfng 65 /^ 0 ^^^^ vocabufary^t^rough^th^lnpt^ 
translate^lntolwo 0 “ np ' icab ry* ^ a^sel «2d .TSndttJ? 
The program formats' these entries according to the rules f^ 

e° t f ^ f T . h i e e) nP b«o V m°e Ca t b ht Ky?~ ^tb7e7t tlSS* 

Phrase M atAlng r ?ile) VO bec b omes ry the 0 Subject e Sw1tchlng h posting b term'' 1 ‘° th * 
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From Phase 2 







For example: 


Target Vocabulary Posting Term: 
Input Vocabulary Translation: 
(from Phrase Matching file) 
Creates Subject Switching Entry: 
pT Abrasion Resistance 


Abrasion resistance 
Abrasion, Resistance 

Abrasion resistance 


spot indexing. * * feedback from * b ° 

coordination should become apparent to""* t^VnTd^analyst! tW ° vocabuUr1es - 

create ^table "entries ° P When* ’all s0 ” e . comb,riat1 ? n of them, may be used to 
contributing * = ^ « fi 

used for Phase"iwo T f I* I ef^y “be’ uMd “fo/pha’s^Three. correct1 °" procedure s 
Subject Swi tc'hi ng * fi 1 e ^ the ^NLD^Svs tem th *rh C ° Cir ' V n f5 ed DTIC terms to the 

5 £Tixr “MMi 

beei^translated! ^ 


P . o^ired Programs/Tools . Phase Three reou'res 

components already "described in Phases One and Phase Two: 


the following 


1 . 

2 . 

3 . 

4 . 

5 . 

6 . 


Online Maintenance Software 
Load Program 
Access Routine 

Subject Switching File Validation Programs 
Continuation Entry Generation Program 
Application Program 


candidate 

7 - noTc™ wTteXr V0CabU ' ary Phrase MatCh1 " 9 fiU (,f 
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Figure 9 

DTI C/NASA Subject Switching Output 


sic crc/cnc data 

ADI » A 124 122 CATS * S3C4C6 Unclassified report 

C2 Field 08020 , 14050 

C5 DEFENSE HAPPING AGENCY HYDRCGR AEHIC/ TCE03R APHIC CZNTE2 WASHINGTON DC 

C6 Report cc CflA • S Prototype Graphics free Enhanced Landsat Iaagery foe 
Applications to Hydrographic Charting. 

C9 Rept. fer 5-8 Apr 83, 

10 Naylcr, Alcnzo D. :lafcllett€,williaa H. 

12 2 I p 

22 *Iaage processing, ‘Naviqaticn charts 

23 Oceans, Multiband spectral reconnaissance, Aerial photographs. Optical 
iaages. Digital systeas, Graphics, Production, Quick reaction, Analoq 
systeas 

25 LANDSAT satellites, DIES (Digital Inage Processing Systea) 

<7 The Defease Mapping Agency (ERA) is currently developing prototype graphics 

sensed iaagery fer support to hydrographic survey planning and 
»MA s nautical chart aaintenance pregraa. The iaagery for these prototypes 
is Landsat scenes that are enhanced by digital iaage processing techniques, 
cr processed totally in an analog aede for gulch response reguireaents* This 
paper discusses these processing approaches within the fraoevork of the 
prototype efforts, Landsat's anltispectral scanner iaagery in the Makassar 
Strait of Indonesia is cczpater enhanced to highlight hydrographic 
indorsation such as shcals, uncover areas, land-water boundaries, and 
shallow water depth intervals. These ethancenents are graphically presented 
m a variety of scales, fotaats, and color assignaents representing three 
approaches to ccaputec enhanceaents. To Ftoduce quick response graphics, the 
analog approach to enhancesent involves the use of a color additive viewer 
and auitiscale pro jectcr/viewer for analysis of aultispectral/nultitenporai 
Landsat fila. The prototype graphics using this approach were developed to 
suepert DMA’s chart aaintenance Frcgraa, but could be use as a tool for 
survey planning in shallow waters. 
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?ISLE 23 DTIC TEEMS 

AERIAL PHOTOGRAPHS 
ANALOG SYSTEMS 

DIGITAL SYSTEMS 
GR AEEICS 

IMAGE PROCESSING 

EDLTIBAND SPECTRAL RECONNAISSANCE 

NAVIGATION CHAHTS 

OCEANS 

OPTICAL IMAGES 
PRODUCTION 
CHICK BE ACTION 

FIELD 25 DTIC TERMS 

LANDSAT SATELLITES 

DIES DIGITAL IMAGE PROCESSING STS' 


NASA POSTING TERMS 

AERIAL PHOTOGRAPHY 
ANALOG DATA 

SISTEMS ENGINEERING 
DIGITAL SYSTEMS ♦ 

GRAPHIC A2TS 
IMAGE PROCESSING* 

SPECTRAL RECONNAISSANCE 

SOLTIS? ECTRAL PHOTOGRAPHY 
CHARTS 

NAVIGATION AIDS 
OCEANS 
IMAGES 
PSCDCCTI0N3 
REACTION TIMS 

RASA POSTING TERMS 

LANDSAT SATELLITES 


42 


8. Table Entry Build Program - A program which takes the output from 
running the target thesaurus through the input Phrase Matching 
file and creates a file of candidate entries from the partial 
matches. 

9. Software for editing the file of candidate entries. 

Manpower Estimates . The level of effort required for Phase Three will 
depend upon the implementation option selected. It is estimated that the 
automated approach of creating and editing candidate entries will reauire 
approximately 2 manweeks per 1000 automatically generated table entries. 
In addition, programming effort may be required for the programs numbered 7 
and 8 listed above. No time estimates are available for the other options. 


Phase Four: User Feedback and Maintenance 

for h^nciffnf * updated to°the° Lexical ^“ r „l! ' the of procedures 

to the input end target , ocabS r?et bas . ed on updates 
Figure 10 presents an overview of Phase Fo*ur aSes. ‘ hb 1ndexers ' 

adds terms to or ’'changes’ terms^IrMts conTroTled contr1but,n g organization 
be made in the Subject Switchina fiu ed vocabular .y> changes must 

be made for each new Individual tiU he ve 7j? ast ’ one entry must 

translation into the target vocabulary. 9 ® ther W1th lts logic code and a 

addltiLf uVles’ o^Wroved uK? bdl t,KJ ho h u,d A for possible 

for mating changes are tf’tS SM? 

organization ^’automatic 0 receipt of^infnr"^- w1tl1 f he con tr1buting 
Without continuing communication k Q a. 1n ^ ormab-!0n on thesaurus changes, 
necessary Information for upda Us ther^wi u \ or9arn ' 2atf °" s end the 
between new terms which should be added to "’he Subject "^L?fn7lnS 

in ^7s h "mu a s r t , 'be Hh r en f e r r , t w ermS are added ° d dba "9"« 

entries Vo^o^Ho ^Tse^io STEfa ^ ^ 

new target U vocaLlary C terL fl ' Analysts , must e i af t e f ted by the addition of 

tables, improved translation’s, or for new translations S $ *t '* addnf0 " a > 
zeroed out. rew trans lations of terms previously 

Updates should be made on a regular basis. 

pert , ectua 1 ^effort * 1 S n UC SuMect^S e . drs * 1s a " '"-Portent 

documents in hand, indexers are unioupl J £ Switcb ™g* With specific 
translations are aDDroDriatP Tnwlv** ab e t0 verif y whether suggested 
should be added to P th P e Subject Switch i So” fn 0t new coord ^ations which 
should be modified or deleted"! * S tching flle » or coordinations which 

It is anticipated that indexer feedback will suggest- 

: SSFsiHSS^ 

• Deletions of table entries. 

Lexical Dictionary t ea m !* P ^n^oV i enl; a t i on ^ pp t ^ tW6 ^ the indexers and the 
of full Subject Switching is essential for i n i ? ?a 1 1*n g r t ^ e ? eldb 6 p rSc I s s ?" 
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Feedback and Maintenance 









v-i 


F1gure^n a !^ SAt ^ T ^ n ^^^ y ^ 1 neede^^Vformation^^^See 

S»1 tcFTny^'cdpabi hty? 9 it ""is** useful to VodVe P aV Trior list' 'o? 9 a 1 

Switching fnT* A™?v?t? at h° n iH Wh1Ch - could not be matched In the Subject 
sLld Te added t. t flle reV,ew the$e td < ’ etern,ine 1f new «**« 

xsj^VSs •;» 'A*rwxs ars 

maintenance software programs are corrected using the online 

Required Pro qrams/Tools . Phase Four requires; 

*' aSpS i n^og"aL SyStem Wlth Adcess Routine > data and 
2. Online maintenance software. 

3 ‘ pragrLs MatCMn9 ff ' e and Subject SKltch(n S file validation 

y-sms^p ' ■= “ =xs ssrx 


H 


LEXICAL DICTIONARY FEEDBACK 



Figure 11 


DATA FILE MAINTENANCE 


Sources of Change to Data Files 

Four sources of change to the NLD data files have been identified. 

o Changes in the NASA Thesaurus. Because the Phrase Matching file 

contains an entry for every - NASA posting term and thesaurus Use 
reference, this file must be updated every time the NASA 
Thesaurus is changed. New NASA terms may replace old 

translations in the Subject Switching file or otherwise change 

translations already recorded. 

• Changes in the Input (DTIC) Thesaurus . The Subject Switching 
file contains an entry For every posting term in the input 
vocabulary; therefore, every new input term must be translated 
and this translation added to the file. 

t Changes Recommended by Indexer Fee dback. These may be for either 
data file and are entered following approval by the 

abstracting/indexing supervisor and the NLD project director: 

Phrase Matching file Use references. 

There is an ongoing effort to increase the number of Use 
references constructed specifically for the NLD. These 
consist of synonyms, variant spellings, and different word 
forms of NASA posting terms. The match capability of this 
file, designed for general purpose phrase matching, 
increases with the number of Use references in the file. 

DTIC/NASA Subject Switching file. 

Indexers provide recommendations for improved translations 
based on actual documents in hand. Most of these 

suggestions initiate changes in the Subject Switching file. 

• Changes Derived from Lists. Lists of input terms that find 

either no match or only a partial match in the NLD are printed 
out each time that a DTIC tape is run through the NLD Access 
Routine. These lists are called exception listings. See Figure 
12. The first column on the printout shows how many times the 
term or combination of terms was encountered on this tape. The 
second column gives the DTIC accession number of the first 
occurrence. The third column indicates the DTIC field from which 
the DTIC posting term came; Field 23 for descriptors, DTIC's 
controlled vocabulary - Field 25 (unmarked) for DTIC's 

identifiers or open-ended indexing. The final column shows the 
partial matches--combinations of terms that are part of a longer 
coordinated entry — or unmatched DTIC terms. The ones in this 
example did not translate because of an input error. One added 
and the other omitted an "s". New terms would appear here, too, 
if they had not been added to the NLD. 
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C4/26/84 DTIC TERMS MOT POUMO IM NASA LEXICAL DICTIONARY 


I 

C033784 

23 ALTITODE; GUIDED MISSILES 

2 

B0309I5 

23 ALUMINUM {COMPOSITE HATEBIALS 

1 

A 1392 53 

23 BBIDGES ; CIBCUITS 

1 

B080692 

23 CIBCUITS; CONTBOL 

1 

S080706 

23 COMMUNICATIONS NPTWOBKS {GLOBAL COMMUNICATIONS 

3 

B080729 

23 COMPOSITE HATEBIALS {MATRIX HATEBIALS 

1 

A 139438 

23 DATA PBOCESSING ; DATA STORAGE SYSTEMS 

1 

C033792 

23 DETECTION {HIGH ALTITUDE 

1 

A 13926 1 

23 ESTIMATES ; CBBITS 

1 

B080883 

23 FIBEB BEIMPORCEMENT { GLASS PIBEBS 

1 

B080682 

23 PLIGHT; SPACE FLIGHT 

1 

A 139485 

23 H AZAB DS ; SAFETY 

2 

A 139476 

23 HIGH BATE {INTENSITY 

1 

B 08 06 93 

23 LI MITATICNS ; POWER 

1 

B080799 

23 MEASUREMENT; PABTICLES 

1 

A 13927 1 

23 SLOPE 

2 

B0807 19 

23 TEST METHODS ; THERMAL PROPERTIES 

1 

A 1392 16 

23 VAPOR 

l 

A 139227 

A EXCITCNS 

l 

B08062I 

A/A37U-15 TOWING REELS 

1 

BO 80823 

ABCS AIRBORNE BEAM CONTBOL SYSTEM 

1 

A 1 39 1 55 

ACB AIR CUSHION BARGES 


A 139337 

ACES AIRDROP CONTROLLED EXIT SYSTEM 

1 

B080779 

ACOUSTIC HOLOGRAPHY 

1 

A 1394 82 

ACOUSTIC IMAGES 

1 

B0809 16 

ACOUSTOOPTIC CELLS 

1 

C033856 

ACTIVE MASS INJECTION 

3 

B080720 

ADAPTIVE ANTENNAS 

Figure 12 


Exception Listing 


except° l for°an t acronym^ preceding SlcVrT.rVr"'^ 1 "\ S ? te ™ 

Matching'f1l'e y The exce p t imiTl s 1 1 n fn , it1ate new entr1es Into "he' Phrase 
OTIC terms that could lo ^KK^tSS?"* " eW C00rd,nat1 '>" s ° f 


follows 


To^ summarize, the files changed by various sources of input are as 


Input Material 

NASA Thesaurus update 
Input thesaurus update 
Indexer feedback 
Exception listings 


Phrase Matching 

X 

X 

X 


Subject Switching 


X 

X 

X 

X 


described^ S. 1 * eaCh 1npUt s °“ r " 


Record Coding 


1 i&asr s Ja&sti & & ^ 

update, the entry is coded as follows: y# 


* i WCf \\\\$ J T Or 

For online 


Logic code$Key$Posting term 




Elements in the key are separated by semicolons 
are followed I by ";00". Multiple posting terms 
$ separates the fields. 


keys 
commas. The 


and single element 
are separated by 


Here are some examples of entries coded for online updating: 
For Phrase Matching 


E$Analyzing;00$Analyzing(3 
ESControllabil i ty ; OOSControl labil ity 

LSGeoastrophysicsjOOAstrophysics.Geophysics 

T$Geol°gical ;Survey$Geological surveys 
$Hi nged } Rotor ;B1 ades$Hi nges ,Rota ry wings 

For Subject Switching 


E$Acids;00$Acids 

C$Amino plastics;00$Thermosetting Resins> 


For Subject Switching (continued) 

L$Animal diseases ;00$Diseases, Veterinary medicine 
T$Blood Circulation;Brain$Brain circulation 
TSBlood Circulation;00$Bl ood circulation^- 
I$Abiotic processes ;00$Abiogenes is? 

0$Acne;00$NIS 

OSAerial pickup system;00$00 

When the entries are loaded into the file, the "$"s which are used as 
field delimiters are dropped, and the online maintenance software 
automatically places the logic code in the correct column. The fields are 
entered in the records as follows: 

• the logic code in column 1 for the Phrase Matching file and in 
column 2 for the Subject Switching file, 

• the key in columns 4 through 127, and 

• the posting term in columns 130 through 400 (variable length). 


Maintenance Functions 


The functions or capabilities provided by the NLD's online maintenance 
system are executed through series of commands. These allow maintenance 
personnel to process input from any of the maintenance sources described 
above. A separate set of commands is provided for each of the NLD data 
files. The chart below indicates the capabilities or functions provided by 
the maintenance system, along with the command used to carry out each 
function for each of the NLD data files. 


Maintenance Functions 

Maintenance System Commands 


Phrase Matching 

DTIC/NASA Subject 


File 

Switching File 

Creating Authority Files 

VALSETUP 

DTICVSAM 

Data File Validation 

NASAVAL 

DTICVAL 

Entering Update Transactions 

NASAUPDT 

DTICUPDT 

Loading Transaction Files 

NASALOAD 

DTICLOAD 

Printing Maintenance Tool 

NASAPRNT 

DTICPRNT 

Printing Maintenance Tool 

NASANVRT 

DTICNVRT 

Creating Backup Tapes 

NASABKUP 

DTICBKUP 


The commands listed above, in addition to several miscellaneous maintenance 
commands, are explained in more detail in the section on "Maintenance 
Commands . 

Additions of New Records . To add a record to any file, use the 
appropriate update command as indicated in the table of online maintenance 
commands above. The form of the entry is: 


51 


Logic Code$Key$Posting term either with or without a symbol. 


Del etion of Existing Records . To delete a record from am/ fii 0 llco 
appropriate update command as indicated in th/ tahip^f n’r 

tenanrp rnmmanHc r...! 1n tt1e tab ' e Of Online 


maintenance commands above, 
unwanted record. For example: 


Enter DEL dollar sign and the key of 


Existing record: 

to be, 

coo^nauo" of these terms 
Enter: DELSDISTRIBUTION PARAMETERS 

updat ^cSnd° as indlm^T n T he UbTe^f U ? e f the »PP™P**te 

above. Delete the existing recorded' ’add the Sord 

misspel?ed r . eC ° rd: E$ERUOPE >°°S™0PE most be deleted as the key is 

eS record, 

the T & L °o q ;Ja c^co^d Is" St ed 

maintenance commands above. Re-enter the record in its correct fo™. " 
For example: 

should' 1 have C a r ?ogic P code 6 of P T!^* EMULS ^ 0NS ^ f>H ^ T0G ^ AP ^ EE EMULSI0NS 
Enter: T$PHOTOGRAPHIC;EMULSIONSSPHOTOGRAPHIC EMULSIONS 
that^same key.® replace any previousl * entered log-ic code for 

IS 

shou,d have a " ■"* 

Enter: E$MEDICINE ;00$MEDICINE@ 

term/s) for that^ame Xey Wil1 rep1ace an ^ Previously entered posting 



Symbols 


Symbols should be used as needed. These have been described at some 
length in the subsections on Symbol s under Phase One and Phase Two. 


Logging On 

Additions, deletions, or changes to any NLD record are done online. 
One user ID has been designated for NLD file maintenance; a second one is 
available for data entry only. Follow the log on procedure for whatever 
database management system used, and when the system prompts that it is 
ready, type in the desired command. NASAUPDT is used to correct the Phrase 
Matching file or DTICUPDT is used to maintain the DTIC/NASA Subject 
Switching file. The use of either of these commands creates a dataset of 
entries which will be used to update the master NLD file. Errors in this 
dataset are corrected online also. 


Maintenance Commands 

The NLD maintenance system provides a series of commands that are used to 
accomplish file maintenance activities. For each type of online activity, 
there are normally parallel commands for each data file. The corresponding 
command for the NASA Phrase Matching file usually begins with the letters 
"NASA". The command for the DTIC/NASA Subject Switching file begins with 
the letters "DTIC". 


Functions 


Provides NLD translations online 
Creates backup tapes 
Creates continuation entries 
Displays file entries online 
Loads transaction files 
Prints file, alpha by postings 
Prints file, alpha by key 
Counts entries, sorted by 
logic code 

Unloads file for large-scale 
editing 

Enters update transactions 
Validates file entries 
Creates authority files 
Displays records online 

These commands are described more 


Commands 


Phrase Matching 
File 

DTIC/NASA Subject 
Switching Files 

DTICACC 

DTICACC 

NASABKUP 

DTICBKUP 

NASACONT 

DTICCONT 

NASAFIND 

DTI CF I ND 

NASALOAD 

DTICLOAD 

NASANVRT 

DTICNVRT 

NASAPRNT 

DTICPRNT 

NASATOT 

DTICTOT 

NASAUNLD 

DTICUNLD 

NASAUPDT 

DTICUPDT 

NASAVAL 

DTICVAL 

VALSETUP 

DTICVSAM 

PRINT IDS 

PRINT IDS 

fully in the pages 

that follow. 


DTICACC This command processes an input word or phrase through the Access 
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DTICBKUP 

NASABKUP 


DTICCONT 

NASACONT 


DTICFIND 

NASAFIND 


DTICLOAD 

NASALOAD 


Routine. It provides, on the terminal screen, the full or 

^ rt 1 *U ranslat10n of the 1 n P ut material, if any translation 
into NASA terms is available through the NLD. Otherwise the 
program returns the message: 


UNABLE TO IDENTIFY 


The command can be used to see how the NLD will 
or groups of terms that do not appear on a tape. 


translate phrases 


These commands create backup tapes for the VSAM master files: 

1 NLD . SSDTIC .MASTER 1 and 
'NLD. NASA. MASTER' respectively. 

A backup is run after every file update so that the most current 
backup tape always reflects the current status of the VSAM file 
Three successive backup tapes are retained in the tape library 
for each file. When a new backup tape is created, it replaces 
the oldest existing, backup. An entry is recorded in the File 
Backup Log (shown in Figure 13) for each successful run of a 
backup command. The job printouts for the last three backup jobs 
are also kept for reference. H J 

These commands initiate jobs that read every entry in the data 
tile, generate all required continuation entries and add them to 
tne rile, and when a new continuation entry has a key identical 
to an existing posted entry, adds a ;00 to the end of the key of 
the posted entry. DTICCONT and NASACONT are used only when an 

1 S0 - ar9e that codin 9 and entering continuation entries 
individually is too time consuming to be economically feasible. 
The commands are executed after the update and at the end of the 
work day so that the programs can be run overnight. 

in??! 5°™ ands searc 1 h the data f ^es for a specified key, and 
print at the terminal ten sequential Lexical Dictionary records 

kp2T 1 nn* W ) th jj he i ey re 9 uested , ^ it exists. If the y requested 
in y whLh 0 Jh f 0 J nd ’ J he P r °gram will locate the sequential position 
in which the key should occur and print the next ten records. 

IJf!Lw 0ni K andS *u load ,m^ it10ns and corr ections from the dataset 

LEX NA^A Mfin i PDT command ’ that is LEX.DTIC.MOD or 
LEX. NASA. MOD, into the appropriate master file in order to update 

•t* is -NLD NASA MASTER^ ^Tff ? ' N LD • SSDT I C . MASTER * and for NASA 
. N L " • NASA. MASTER . The Load command performs a number of 

edit checks on the transactions. Transactions passing the edit 

^p C rfoiitfH d f tran ^ Ct10n ^ are loaded int0 the master file and 
fIL i et ? d from the , work dataset. Transactions rejected by the 
edit checks are not loaded, but are rewritten to the appropriate 

dataset for correction. Rejected transactions are 
e on the printout with a notation of the error which caused 
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NLD. NASA. MASTER 
BACKUPS 


Figure 13 
FILE BACKUP LOG 


NLD. SSDTIC. MASTER 
BACKUPS 


Pate Time Job# Initials 


Pate Time Job# Initial 



c^re^ 

then re-executes the Load command. — aataset and 

" A ” m 

sorted ''afpha^etVcally'by^p^sVfng^ temf^^Entrle^wIth multiple 
K! i" 9 ^V re I 1sted ““ for *«h posting tem The IS 
resSectfS" 0,6 SOn ^ Pr1nt tde fo,low1 "9 files? 

'NLD.SSDTIC. MASTER' 

'NLD. NASA. MASTER' 

Sample pages of NASANVRT and DTICNVRT are shown In Figures 14 and 

ssfflE »as,"5 rs w-tr-sssr 

'NLD. SSDTIC. MASTER' 

'NLD. NASA. MASTER 1 

Sample pages of NASAPRNT and DTICPRNT are shown in Figures 16 and 


DTICTOT 

NASATOT 


DTICUNLD 

NASAUNLD 


The Total command provides a count of the number of entries in 
the appropriate data file, broken down by loqic ode Frr 
messages are written for entries that do not have a valid fojlc 

Th© Unload command copies the entn’p^ in a \/<am r.* n 

sequential files as are needed to hold all of the VSAM file 
entries. The files are named in this pattern: 


LEX.SEQ1.DTIC 
LEX.SEQ2.DTIC 
LEX.SEQ3.DTIC 
LEX.SEQ4.DTIC, etc. 


or 

or 

or 

or 


LEX. SEQ1. NASA 
LEX. SEQ2. NASA 
LEX. SEQ3. NASA 
LEX. SEQ4. NASA, etc. 


fSmat: tr1eS theSe sequent1a1 ™es are In the following 


Columns 1-3 
Columns 4 - 127 
Columns 130 - 400 


Logic Code 
Key 

Posting term 


as .*; 7 nr.,’;:: 

attempt 6 this ' rel oad^hemeTves . personne ^ - cautioned not £ 
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CC/28/62 


OTIC LEXICAL DICTIONARY BY POSTING TERM 


EE 

C 

T 

EE 

T 

T 


REF I N I NG ; OU 
REFLECT IV]TY;00 
COEFF ICJENTS. REFLECTION 
REFLECTANCE .00 
RADIATION. REF lection 
REFLECTION ; WAVES 


T REF L ECT ION . TE LESCOPES 
C INTERNAL RC F L EC T I ON . 00 
L DIFFUSE REFLECT I ON ; 00 

TE REFLECT I0N;00 
T NEBULAE .REFLECTION 
EE REF LECTCVE TERS ; OC 
L REACTOR RE F LECTORS . 00 

EE REFLECTORS ; CO 
L VASOMOTOR RE F LEXES ; 00 


EE REFLEXES; 00 
EE REFORESTATION. CO 
L ACOUSTIC REFRACTION. 00 

T retract ion . telescopes 

EE REFRACTION 00 
, c REFRACTIVE INDEX 00 
EE REFRACT 0M£ TERS. 00 
E REFRACTORY COATINGS. 00 
L HEAT RESISTANT MA TERI A LS ; 00 

T HIGH TEMPERATURE .MATERIALS 
E refractory materials. 00 

E REFRACTORY METAL ALLCrS 00 
E RFFRACTORy metals.-oo 
L HEAT RESISTANT METALS. 00 

EE REFRIGERANTS. 00 
L REFRIGERANT COMPRESSORS. 00 

L REFRIGERANT condenseps . 00 


T 

c 

I 


MACHINES. RE FRIGE RAT I On SYSTEMS 
REFRIGERATION SYSTEMS 00 
COLD STORAGE ; CO 


L CLOSED CIRCUIT RE FUEL I NG ; 00 
L REFUELING PUMPS. 00 


EC 

C 

E 

T 

C 

E 

E 

T 

L 


REFUE i I NG : 00 

REGENERATION E L EC T ROM C S . 00 
REGENT RATION ENG I N‘E ER I NG ; CO 
CYCLES; REGENERATION ENGINEERING 
SUPE RREGENERA T I ON ; 00 
REGENERATION PHYS I0L0GY ; 00 
REGENERATIVE COOHNG:0O 
FUEL CELLS; REGENERATION ENGINEERING 
GAS TURBINE REGENERATORS ; 00 


Figure 14 


REFINING* 

REFLECTANCE 
R£ I LECTANCE 
REFl tCTANCE 
REFltCTtD WAVES 
RE F L EC T E D '.A VE S? . 

WAVE REFLECTION? 
REFLECTING TELESCOPES 
Rtf lection 
reflect ion . 

DIFFUSE RADIATION 
REf LECTION 
REFLECT JON NEBULAE 
REF L EC T CMC I£PS 

reflectors. 

NUCLEAR REACTORS 
reflectors* 

RE F L txES . 

NERVOUS SYSTEM 
RE F LFXES* 

RCFORESTAT ion 
A wOUST ic ATTENUATION. 

REfRACTEO WAVES 
RE F RAC T ING TELESCOPES 
REFRACT ION 
Rf * PACT I V 1 T y 
REF RAC TOME TERS 
RE FRACToRr COAT INGS 

refractor* materials 
thermal resistance’ 
ref rac t ow f materials 

REFRACTOR' MATERIALS* 

RE I RAC TOR y METAL ALLOYS 
RE F RAC IOP ' MET ALS 
REF RAC TOR f METALS. 

thermal resistance 
refrigerants 
compressors 


RE f R ] GE RAT I NG MACHINERY 
CONDENSERS ( LIQUEFIES). 

refrigerating machinery 
REFRIGERATING machinery 
REF PIGERA7C R$ 

ENERGY STORAGE* 5 . 

REFRIGERATORS? 

REFUELING. 


automatic 
refueling 
fuel pumps 
REFUELING 

regeneration 

regeneration 

regeneration 

REGENERAT ion 

regeneration 

regenerative 

REGENERATIVE 

regenerators, 


CONTROL 


(ENGINEERING) 
( ENGINEERING) 
( ENGINEERING) 
(ENGINEERING) 
(PHYSIOLOGY) 

cooling 

FUEL CELLS 


> 


DTICNVRT Sample Output 


OKlGii' * 1 

OF POOR QUAL..V 
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or r *■ ' v * ■ 

U,/;||/I1J NASA LEXICAL DICTIONARY Ui HOST » No II KM 


T (JOHN I f M . A | HCRAT T 

T .HA RAM I DER. ROCKET ; VEHICLE 

T LOI ‘.AL . !il C 1 IONS 
C DOS! .O ' 

E DOliAoL.OU 
C DO'* I Mf IRY .00 
E DOM ML I I MS . UO 
L CORDITE .00 

T D0U31 L . liASL . PROPEL I ANTS 
T bO'GU L . i:<M ;»:DChE I .PROPELLANTS 
C GSt Ul A 1 iONS .till 
T DWIlil L . (.0 >1 

t noma l.im c imon. arithme r ic 

T DlAJhll Slbl HAND. TRANSMISSION 
T DOOUl A*. . A I RU.AF T 
E Down - ( mm HUMS ;00 
E DOWUl INKING. DU 
T OOwNRAt.Gi .on 
T . PROGRAM 

T DOwNRAU.t . AIM IMISSI LE ; ME ASUREMENT ; PROGRAM 

T DgWNRAi.GL . Ml ASUREMENT 

C DOWN I I MI ,uu 

E Down-wash. od 

T DMA CON t I) . MI If 0RC1DS 

T bit Af I .(.AS. I LOW 

T (Jim 1,10 

T DRAf 1 II. (..DRAWING 

T DRAT I 100. MACHINES 

T OM.Ui.cn IC1 

T DRAG . 00 
T DRAG : ( I ill 1 1 S 
T OMGGUI .PARACHUTES 
T DRAb.COf I I ICitNTS 
L bl’AGUl AIUUS.UO 

T DRAG .1:1 VIU S 
T DMAi» ; I 0F« L . ANEMOMETERS 
T DR AG. Ml AMIRI Ml NT 
T DRAG ; M DUL I l ON 
C DRAINING Ou 
C Rl II 10 1 I '• .OO 
T DRA I NAGF wii 
T OHlOh I I 1C . DRAINAGE 
T URAINAC.L . PA IILRNS 
T I Nl Lki A(.l No . DRAINAGE 
T RADI Al . DRA INAGE . PAT TERNS 
T Rif T AN .Ul AM ;DN4INACE 
E DRAW I N-.. .(HI 
E DRAWINGS. Oil 
T ULVA1 IONS .DRAWINGS 
E DHL AMS. DO 
T bRL DgCD . MA 1 1 RIALS 
E DRI DOING. DO 
T DM 1 1 T . I NS I RlH.f NT AT I ON 
T INSTRUMENT .DRIFT 
T DRU T .RAIL 
T DR if I .00 


DORN IIR AIRCRAFT 

OORNUP P.UAGLIDEM ROCKET VEHICLE 

DiiRoAl SI Cl IONS 

DiTOAi.C 

DoS At i 

DOS 1 1- 1 I LIU. 

DOME*! II MS 

COL lu|l*AI I ROPE HANTS . 

DOOUl ( RASE P ROPE 1 1 AN ! S 
DON Rl I | : A Si PROPELLANTS 
DOOM I E I .A SI ROCKET PRONE LI AN I S 
Doom E costs 
DotmiE cusis 

DwUlilE PR t < 1 S I ON ARITHMETIC 

DolllUt S 1 bl SAND transmission 

bout. I AS All CRAr T 

DOWN CONVERTERS 

Down i I Nr. it.o 

DOWORANt.L 

DoWORANol ANTIMISSILE ME ASURI WENT PROGRAM 
DOWNRANOl ANTIMISSILE ME ASUMl ME NT PROGRAM 
D( tWNRANOl I* E ASUREMENT 
DOWN I I ME 
bOWl. v. A SH 

URAL ON ID Ml TEOROIDS 
DRAT I (GAS FLOW) 

DR A f I- 

DRAM INC. (DRAWING) 

01 Af I I No MACHINES 
DRAG 

0 R A C. 

Drag ClioT I s 
DRAg Cl Hi 1 1 s 
ORAG CM If ic I ENTS- 
DR At. bl VI U S . 

IIRAKL S O OR ARRESTING MOTION! 

DRAG Ul Vl( I S 

DRAG IORCL ANEMOMETERS 

UR* Alt MI A SOI EMENT 

DR/.g Rl DUC I ION 

UlfA INAuf 

DPAtOAut 

Dr A II.AGl. 

ORAINAGl PATTERNS 
DRA If. At. f PATTERNS 
DRA 1 DA 01 PATTERNS 
dra inage f at terns 

DRAINAGE PATTERNS 
DRAW 1 fit . 

draw i ho*; 

DR Aw J Nbb 
DRI AMs 

OKI DOf U MATERIALS 

1 JR I UU N(. 

OR I I I I IMSIRUMCNTAT TON) 

DR I I I I IN .IRUMINTATION) 

DRIM RATE 
ORi f o 


Fiqure 15 NASANVRT Sample Output 
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06/14/UJ 


OTIC LEXICAL D I C T ] ON A ft V 


OF PG u\‘ G. 


T FOI L S M& 1 f HI Al S . ME TALS 
E FOILS MATf RJAIS .00 
C f OKKf R I'L ANC K EQUATIONS 00 
C FOI OLD UM ILAL lENSE $;00 
C FOLDING f INS ROCKETS . 00 
l FOLDING Ml. k I COP TER ROTORS; 0 O 

L FOLDING WINGS ; 00 

T FOLDING. S 1 RUCTURLS 
TE FOLDING . 00 
E FOLDS GEOLOGY . 00 
EE FOLIAGE. OU 
E FOLIC ACID. GO 

c food Chains. no 

C FO‘JD CGNLUMP II ON ; 00 
C FOOD DEPRIVATION ; 00 
L FOOD Df IERIORAT ION.OO 

O FOOD DISPENSING . 00 
0 r Q 00 HANDLERS. Ou 
L FOOD POISONING ; 00 

L FOOD PRESLHVAT I ON ; 00 

E FOOD PROCESSINGiOO 
0 FOOD SERVICE PERSONNEL ; 00 

0 FOOD SERVICE .00 

T FOOD . S YN ThE I I C MATERIALS 
TE FOOD ; 00 

C FOOT AND MOUTH DISEASE VlRU $;00 

1 FOOT Wt Aft ; DO 


C 

T 

T 

T 

C 

0 

EE 

C 

C 

E 

0 

0 

0 

E 

C 

T 

T 

EE 

C 

T 

T 

EE 

0 

T 


FORAMINl f f RA .00 
FORCE ML MIAN ICS . FREE 
FORCE MECHANICS . FREE 
FORCE Ml CHAN l C s ; INERT 
FORCE Ml CHAN ICS; 00 
FORD 1 l*G ; OU 


FIELD 

FIELD; MAGNE TIC FIELDS 
IA 


FORI CAS I IMG . 00 

FORfc IGN AID ; HU 

FORL IGN l AIK. DACES . 00 

FORE IGN I'DL ICY . oO 

For e I c>n siRviCE orriCERS.oo 

foreign f EC mnOLOGY .00 

Furl IGN. DU 

FOREST FIRES ; 00 

FORES TRY. 00 

FORESTS. MANAGEMENT 

FORESTS. RAIN 

FORESTS. 00 

rORGE PRESSES ;00 

FORGING. ME I ALS 

FORGING. SPINNING MOTION 

FORGING ; 00 

forklift VEHICLES . 00 
formaldehyde ; phenols 


ml i a l foils 

foils (MATf RIALS)* 

f HKKl R PLANCK equation 

UNCI S^ 

(uim|ic HU /uhcraft rocket vehicle 

F 01 D . NG . 

ROIARy wings 
F Gl DING S T I'UCT URES 

wings 

FuiDINu STRUCTURES 
FOLDING 

FOLD*, I GEOLOGY ) 

FOL I AGL 
FOLIC ACID 
FOOD CHAIN 
FOOD IN 1 AKL 
Food Intakl 
U 1 ERlOf<A 1 ION 
FOOD- 
GO 
00 

fuou iniake 
POISONING* 

FOOD PFfGC CSSING. 

PRf M R V I f *G 
FOOD PROCESSING* 

00 

Cu 

SYNTHETIC FOOD 

fool 

V I R USE S > 

Doors IFOOlWEAR)?, 

SHOES? . 

Socks •• 

PRGKi&GA* 

FORCt FREE MAGNETIC FIELDS 
INLRIIA 

TOADS (FORCES) 

Do 

forecasting* 

F GRf I CiN POL I C Y> 

L ANOUAUES> 

fore ;gn policy 

GO 

00 

N 1 S 

FOREST FIRES 
FOREST MANAGEMENT 
FOREST MANAGEMENT 
RAIN FORE SIS 

forests 

F’RL SSE s> 

Forging 

mual spinning 
forc.ing 

DO 

Phenol formaldehyde 


Figure 16 DTICPRNT Sample Output 


OG/M/IU 


NASA LEXICAL DICTIONARY 


T SOLAR; THERMAL 

T SOI AR. Hit RMAL. ELtCTR I C 

T SOI All . Hit RATAL .ELECTRIC; POWER 

T SOI AR . THI RM\L; ELECTRIC; POWER; PLANTS 

T SOLAR. THU<h1AL;f*ROPLJLSION 

T SOLAR. tOIAL 

T SOLAR. TOIAL .ENERGY 

T SOI AR; IOTAL; ENERGY ;SySTEMS 

T SOL AK ; VL LOCI TV 

T SOLAR. WIND 

T SOL AK. WIND, VELOCITY 

T SOLAR . Wi NO . OU 

T SOLAH.X RAYS 

T SOLDERfcD. JOINTS 

E SOLULR 1NG . Oil 

E SOLDERS. 00 

T SOL. t NO 10 . VAl VE S 

E SOLENOIDS: 00 

E SOLE If A.*. UO 

T SOL ID. ARGON 

T LJL ID . ( RYOgEN 

T SOLID. (RtoUN. COOL ING 

T SOL ID. CRYOGl NS 

T SOLID. mUROOES 

T SOL lO.lLLCH OLYUS 

T SOLID. LUBRICANTS 

T SOHO. Nl II.OOLH 

T SOLID. PHASES 

T SOLID. PRuPtl LAHT 

T SOLID. PRuPt LEANT COMBUST ION 

T SOL ID. PRUPt l LANT . IGNI 1 ION 

T SOL ID. PROI I l LANT .HOCK! T 

T SOI ID PROPF l LANT . RGChE T ; ENG1 NLS 

T SOI ID. P.<OcEL LANT s 

T SOLID. fOLKl I 

T SOL ID.ROCKI T .BINDERS 

T SOL ID. ROCKl T PROPEL l ANTS 

T SOLID. R01 AT 1(*N 

I SOLID. S' *1 1) T IONS 

T SOLID. MAM 

T bOl ID. SIAIE. DEVICES 

T S Ol Ili.Sl ATE . iASrkb 

T SOL ID . SI ATE ; PHYSICS 

T SOL ID. St AIL .DO 

T SOL ID;SURf A* IS 

T SOL ID. SUSPENSIONS 

T SOL ID. WAS US 

1 SOLID ROCKET .00 

T SOL ID SOL ID. INTERFACES 

E SOL I DU 1C AT ION; 00 

T SOL IDIFIED. GASES 

T SUM DS. FLOW 

T SOLIDS. OO 

E SOLIDUS. OU 

t S0LIDNS.00 

T SOL 1 T ARY ; WAVE S 

E SOL I tMANi S ;00 

c sol I Tons # oo 


SOLAR IhfRMAL ELECTRIC POWER PLANTS 
SOLAR Till Rf.'AL PROPULSION 


SOLAR TOTAL ENERGY SYSTEMS 
SOLAR Vt LOCI TY 

SOI AR WIND VELOC I T Y 

Sol AR WIND 

SCI AR X-RAYS 

SOL DL RED JOINTS 

SOL Dl R I nG 

SOL Dt Rb 

SOLENOID VALVES 
SOU NO IDS 

solettac 

SO L I Dl PIED GASES 

SOLID CkYOuLN COOLING 
SOLID CRYUifNS 
SOI ID lleurodes 

SOL ID FUUROLYTES 

solid lul;ricants 

SOI ID Nl f ft* GEN 
SOLID PM A Si S 

SOL I d phoplllant combustion 

SOL ID PRDPl LLANT 1 GN 1 T ION 

SOLID PRc, PI L LANT ROCKET ENGINES 
SOI ID I Ruff LLANTS 

SOL I D RuCK l T BINUIKS 
SOLID ROCKl T PROPELLANTS 
R'0 1 A 1 t N(* Li' >D1ES 
SOL ID SOLDI IONS 

SC. I ID STATE DEVICC S 
SOL ID ST AIL LASERS 
SOL ID S I A U PhY S !CS»* 

SOL ID STATE 
SOL ID sum ACES 
SOLID MJ SPINS IONS 
SOL It) WAS IIS 

SOLID IROPUlANT ROCKET ENGINES 

SOI ID- Sul ID INTERF ACES 

Sul I Dl f I C A I ION 

SUL IDI Ml D GASES 

SOL I Db F LOW 

Sul IDS 

SOL 1DU j 

SOI IONS 

SOL I TAkr WAVES 
soli Thane s 
solitary waves 


Figure 17 UASAPKNT Sample Output 
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NASAUPDT IncludlngTddntonV'or 6 deletions *are ^nt'^d f Chan9es 
checks take C p!ace 15 Transactions* 1 M«*™ 1n » a h ’ a series of edit 

transactions^are^oad&Mnto^temporVry 9 work dauset*! 6 ** 5 ( "° d 

Rejected transactions generate error messages online. 

XlliT meSSa9eS th “ Sre retu ™' ! interactively by the system 
INVALID CHARACTERS 

valid set^ aC A-Z^ 0 C -°9 n , ta + l , nS ? ch ® rac ^ er f other than the following 
INVALID CHARACTER IN LOGIC 

sign/:*^* 5 ^^ 6 tnde C posTt *on S ( or* before a the*f & irst ^dollar 

DEL> C, E, I, L, T, 0 (zero), or blank. 

LOGIC CODE TOO LONG 

th^transact Ion. CharaC * erS ° r b ' a " ks a ^ r b * f °™ the first $ In 
LOGIC code all blanks 

Three blanks appear before the first $ in the transaction. 

NO POSTING TERM 

Nothing appears following the second S in the transaction. 

TOO MANY $' s 

More than two $ ' s appear in the transaction. 

INVALID FORMAT 

The transaction does not conform to one of the formats 
^ 9 - C ement;Element$Post1ng ?erm 

nFi 9 «ie COde I E en,ent iOOSPosting term 
DELS (Key of record to be deleted) 

COMMAND NOT FOUND 

make sure that "you 3 are 9 logged o^ under 9 th^ c ° ni J and » check t0 
that you have SDelled un der the maintenance ID and 

persists or if P other ^« n . C ? rrectly - If the P™b1em 

application programmer assigned to & * he 

be"fore° f th*e ' % “ X a«e*pt Ve^y*?' 
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When the session is ended by entering /*, the system performs a 
second series of edit checks on the transactions held in the 

temporary file, and loads the transactions into the LEX. .MOD 

file appropriate to the command. These files are respectively, 
LEX. DTIC. MOD and LEX, NASA. MOD. Rejected entries are listed on 
the printout under the heading TRANSACTIONS IN ERROR and must be 
researched, reformatted if necessary, and entered correctly using 
the online edit capability of the data base management system, 

DTICVAL These commands initiate comparisons between data files and 

NASAVAL authority files. DTICVAL compares the DTIC/NASA Subject 

Switching file entries with the NASA and DTIC thesauri authority 
files. DTICVAL checks: 

• Every DTIC term appearing in the key field against the 
DTIC Thesaurus authority file. If a term in the NLD 
key does not appear in the authority file, an error 
message is generated. 

• Every NASA term appearing in the posting term field 
against the NASA Thesaurus authority file. If an NLD 
posting term does not appear in the authority file, an 
error message is generated. 

• Every posting term in the DTIC Thesaurus authority file 
against the NLD keys. If there is no key in the NLD 
for the Thesaurus posting term, an error message is 
generated. 

These error messages highlight the additions, modifications, and 
deletions required in the DTIC/NASA Subject Switching file. 

NASAVAL initiates a set of comparisons between the entries in the 
NASA Phrase Matching file and the NASA Thesaurus authority files. 
NASAVAL checks: 

• Every term appearing in the posting term field against 
the NASA Thesaurus authority file. If an NLD posting 
term does not appear in the Thesaurus authority file, 
an error message is generated. 

• Every posting term and Use reference appearing in the 
NASA Thesaurus authority file against the NLD file 
keys. Each of these terms should appear as a key in 
the NLD file, and an error message is generated if it 
does not. 

• Every posting term in the <A$A Thesaurus authority file 
against the NLD file posting terms. If a Thesaurus 
posting term does not also appear as an NLD posting 
term, an error message is generated. 

These error messages highlight the additions, modifications, and 
deletions required in the Phrase Matching file. 


DTICVSAM This command is used to create a DTIC Thesaurus authority file 
from LEX. POSTTERM. DTIC, a list of posting terms from DTIC. 
LEX. POSTTERM. DTIC is a sequential file created from DTIC's 
Thesaurus tape and so far updated manually by HLD maintenance 
personnel using online editing capabilities. Each time 
LEX. POSTTERM. DTIC is updated, a new VSAM authority file must be 
created with the DTICVSAM command. 

VALSETUP This command creates two authority files for NASA Thesaurus terms 
from the online Thesaurus files: 

0 A sequential file of NASA posting terms and Use 

references. This file is used by the validation 
routine to check that there is an entry in the Phrase 
Matching file for every NASA Posting term and Use 
reference. 

• A VSAM file of NASA posting terms only: 

'NLD. THES. TERMS' 

The VSAM file is used by NASAVAI. to verify that all posting terms 
appearing in the posting term field of existing entries in the 
Phrase Matching file are valid NASA posting terms, and by DTICVAL 
for the same purpose in the Subject Switching file. NASAUPDT, 
NASALOAD, DTICUPDT and DTICLOAD use the NASA VSAM authority file 
for validating new transactions being added to the data files. 

As the VSAM file is being created, each term is checked against 
the Phrase Matching file ( ' NLD . NASA. MASTER 1 ) to determine if it 
should be marked as an array term and to add the 0 to the term if 
required. To look at a term in the NASA VSAM authority file 
'NLD. THES. TERMS ' , use the PRINT IDS command. 

PRINT IDS This command allows an online look at any VSAM file record and at 
a user-specified number nf additional sequential records. 
DTICFIND and NASAFIND are shortcuts for displaying records in 
master files 'NLD. SSDTIC. MASTER ' and 'NLD. NASA. MASTER' , 
respectively. However, to see a record in other VSAM files, for 
example the NASA file 'NLD. THES. TERMS ' , one must use PRINT IDS. 
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Printout Review 

Following execution of a command, any printout that has been generated 
is examined by NLD personnel. This is to see: 

• Whether or not the job has run satisfactorily. 

• Whether or not there are any errors that must be corrected. 

The Facility NLD Maintenance Manual lists step-by-step instructions 
for recognizing and correcting errors from each listing. 

In general, errors listed in printouts generated by any of the 
commands are listed under a heading that implies what the problem is and 
how to fix it. 

A sample of error messages listed on printouts generated by NLD 
maintenance commands follows: 

KEY (unmatched element) OF (key of rejected transaction) IS NOT FOUND 

The transaction has been rejected because the specified element of the 
key does not match any entry in the input posting term authority file. 
The non-match may be the result of: 

1. misspelling in the transaction, 

2. failure to separate multiple elements of the key with 
semicolons, or 

3. an error in the input posting term authority file. 

If the error is of types 1 or 2, correct the error in the appropriate 
LEX. .MOD. file before re-executing the load. If the error is of 
typeTT use the online edit capability of the data base management 

system to correct the error in the corresponding LEX. POSTTERM. 

file, and execute the VSAM command to recreate a corrected input 

posting term authority file. Then the Load command may be 
re-executed. 

POSTING TERM (unmatched posting term) OF (entire posting term field of 
rejected transaction) IS NOT FOUND 

The transaction has been rejected because the specified element of the 
transaction's posting term does not match any entry in the NASA 
posting term authority file. The non-match may be the result of: 

1. misspelling in the transaction, 

2. leaving out a required 

3. including an incorrect 

4. failure to separate multiple posting terms with commas, or 
5\ an error in the NASA posting term authority file. 
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appropriate LEX. .MOD P f lie* before^r/p C °7'. eCt the error in the 
error is in theluthority file ftvnp Vf eCU V-i 9 t !l e load * If the 
The error must be corrected in th^ecrnu* the lexicographer. 

NASA posting term authority d file creatP^h 0nllne T . he saurus and a new 
the Load command can be re-executed * * by execut1 " 9 VALSETUP before 

t^ransaction)' eJeCted *""■“«"> or NO LOGIC CODE 

or*" wal^in^rea? ^Correct^L^oaic l09 - iC COde WaS missi ‘ n 9 
LEX. .MOD file and re-execute the Load command. ^ th6 appropriate 

I n^^Sub ject^Sw^ ^ k ^ y Rejected transaction) 

alphabetical order. Correct ’the key In^thl the - key must be in 
file before re-executing the Load command. 6 appropnate LEX._.M0D 

capabilities o^thi^data base man corrected usin 9 the online edit 
corrected, it is loaded in ba ' e t ^ ana 9^ en t . s y $ tem. When the file s 
DTICLOAD, or NASALOAD appropnate data file by entering 

This contains error°^^ Dictl^ UNMATCHED KEYS - 

=;»-* x* ;xvs, ;e,is 


KEY NOT FOUND 
OF RECORD 


CLARK 

T CLARK ; DUKE SUPER PROGRAMMER 


a correcVed^entry the record and add 

» %e t0 v h % D £FT s ^ 

LEX. POSTTERM. OTIC file usina thp nfif ’ ,t 1x tbe err °r in the 

base management system and then execute DTICVSAM^ 11 iti6S ° f the data 
Thesaurus authority file. execute UTICVSAM to create a new DTIC 

The page headed UNMATCHED POSTING terms • 

each Lexical Dictionary entrv rnnfl,- • contains an error message for 
matched in the NASA Thesaurus authonty^np^ 51 !^ 9 *1™ which 1?s not 
errors in the following format- ty f The Job prints out the 


Logic Key 
Code 


Posting Term 


For example: 

T Clark;Duke Super Programmer 


Error Message 


Key not found-Super Programmer 
Next record is Supercavitating 
Mow 3 
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In this example, the program could not match the posting term in the 
authority file at all. (This is a test record added to the file by 
said programmer.) As an aid to the analyst, the programmer prints out 
the record appearing in the authority file following the place where 
the unmatched posting term should have appeared. When the error is a 
typographical error in the Lexical Dictionary key, the "next record" 
is frequently the correct spelling of the NASA Posting term. 

C Power Equipment ;00 Electric Equipment® Key not found- 

Electric Equipment® 
Found Electric 
Equipment without ® 

In this example, the program could not exactly match the Lexical 
Dictionary posting term Electric Equipment® in the NASA Thesaurus 
authority file, but did find Electric Equipment (the same term without 
the "®"). Check the term in question in the NASA Thesaurus. 

If the term is not an array term, correct the entry in the Lexical 
Dictionary by deleting the ® from the posting term in the entry using 

the UPDT procedure. See Changes to an Existing Record subsection 

of "Maintenance Functions." 

If the term is an array term, the error (a missing "@") is in the NASA 
Thesaurus authority file. The VALSETUP program which creates the NASA 
Thesaurus authority file adds the "®" to terms if an "®" appears 
following that term in the NASA Phrase Matching file. Therefore, if 
this type of error is located in the NASA Thesaurus authority file, it 
means that there is an error in the NASA Phrase Matching file for that 
term. The Phrase Matching entry is located in the 'NLD. NASA. MASTER 1 
file using NASAFIND. 

Using the NASAUPDT procedure, the @ is added to the posting term, and 
VALSETUP is executed to create a correct NASA Thesaurus authority 
file. 

T FI ight;Stresses Flight Stress Key not found-Flight stress 

Found Flight Stress 
with @ 

In this example, the program could not exactly match the NLD key 
Flight Stress in the NASA Thesaurus authority file, but did find 
Flight Stress®. Check the term in question in the NASA Thesaurus. 

If the term is not an Array term, the error is in the NASA Thesaurus 
authority file. See the explanation in the preceding example for how 
to correct this type of authority file error. 

If the term is an Array term, the error is in the DTIC/NASA Subject 
Switching file and can be corrected by adding the "@" to the Dostina 
term in the NLD entry. 
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A page headed UNMATCHED DTIC TERMS lists an error message for every 
term which appears in the DTIC Thesaurus authority file for which no 
entry appears in the DTIC/NASA Subject Switching file. The errors 
appear in the following format: 

TERM NOT FOUND = (unmatched term) 

Check the unmatched term in the most recent DTIC Thesaurus supplement. 

If the term is a valid DTIC term: determine the correct NASA 

translation, create a new Subject Switching file record for the term, 
and add the record to the Lexical Dictionary file using DTICUPDT. 

If the unmatched term is not a valid DTIC term, the error is in the 
authority file. Correct or delete the incorrect entry in the manually 
maintained DTIC Thesaurus file LEX. POSTTERM. DTIC. Execute the 
DTICVSAM command to create a corrected DTIC Thesaurus authority file. 

Other error messages may report on the status of: 

NASA THESAURUS VALIDATION 
THESAURUS TERMS-NO NLD KEY 
THESAURUS TERMS - NO NLD POSTING TERM 
READING FOR ATSIGN AND KEY NOT FOUND (term) 

If there are problems that the NLD maintenance personnel cannot 
correct, the application programmer assigned to the NLD must be consulted. 
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RESULTS AND CONCLUSIONS 


Applications 

A lexical dictionary can be used for any application that requires 
translation of input phrases to phrases in a target vocabulary. As of 
January 1, 1984, NASA was using the NLD for three applications: 

• building Subject Switching capabilities 

• processing DTIC (TAB) tapes, and 

• processing DOE (EDB) tapes. 

Each of these applications uses the NLD system in a different way. 
Building Subject Switching capabilities uses the Phrase Matching mode and 
accepts all matches, complete or partial, from the NLD system. DTIC TAB 
tape processing uses both the Phrase Matching and Subject Switching modes, 
and accepts only complete matches from the NLD System. DOE EDB tape 
processing used only the Phrase Matching mode (until the DOE/NASA Subject 
Switching file became operational) and accepted only complete matches from 
the NLD system. 

As this is being written, two other applications are in the programming 
stage: 

• Using the Phrase Matching file to process Library of Congress 
MARC records and accepting complete or partial translations. 

• Using the Phrase Matching file to process natural language 
phrases automatically extracted from abstracts or other text and 
accepting complete or partial translations. 


Benefits 

Benefits obtained from the use of the NLD wore measured with the least 
possible disruption to the indexing process. The hypothesis upon which the 
NLD was authorized was that the NLD would increase the indexers' 
productivity and reuse the indexing already done by DTIC. It was intended 
that the quality of the indexing would remain high. The following analyses 
shows that our samples were adequate, that the results are significant, and 
that we have proven our hypothesis. 

Evaluation Methods . The evaluation of the NLD was based on a 
comparison of the preliminary subject analysis study done for December 

1982 through March 1983 with a post-implementation study done for December 

1983 through March 1984. 

Study number 1 included confidential interviews with each indexer; all 
were conducted by the same interviewer to ensure consistency. In addition. 
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'rJS&'X s22j«t U T:nch7„V e fir Ct t e h d f ™a a Sl " s,e DTIC ™ tape, and the 
documents were taken from an ra t 0 the - e documents were analyzed. These 

there were fewer' than 100 cile the tape * S1nce 
some categories in approximate DroDortinn P ln Ctl0 1 s Were made from 
assigned to the more populous categories. ° 1 the number of documents 

observed ^imr^udie^^woufd be "intrusive ^nd's'l^ 6 the J e WaS concern that 

documents, drawn over a'fhV^-month ° f 15 ° ° TIC 

documents, twcTof the^TIC* ^ostVna t p lthou ? h stud ^ 1 had a sample of 100 
deviant leaving 0 ! sample s ize for' DTK oAf *\Z a$ being t0 ° 

some of the comparisons made. ’ ^ be ^°^ ow1n 9 table shows 



Study 1 (Pre) 

uTic IPSA 

Study 2 (Post) 
ETTT NASA 

N 

98 

100 

250 

250 

X 

14.32 

9.59 

13.09 

12.60 

7j x 2 





s =Vn^t 

2 4ii 

4.88 

2.03 

4.73 

2.05 


23.81 

4.11 

22.35 

4.19 

s 

.49 

.20 

.30 

.13 


Documents in sample 

Mean of term assigned 

Standard deviation where 
£ = the sum and X = 
the deviation from the 
mean 

Variance 

Standard error of mean 


the deviation of the ^st/mated' mean fronfth* 16 " t " t8St ^ used t0 ascertain 
comparative study (shown above) are valiS for'thV entire ' pop^lo*. ° f 0ur 

considerable 'Slff^nnce 1n° thTtveraVe dumber n “, s1 " 9 H the NLD> there »s 
the two agencies: 14.32 to 9 59 stnriv/ n ? Um * b t? r °^. 1ndex terms assigned by 
very close: 13.09 to 12.60 y Sh ° WS that the ava rages are no£ 

indexHTenns ass i gned ° to P a° document!' as^fndicated 56, 1n th the . number of NASA 
only signals increased productivity but the abo y e table » n °t 

access points to a document. TiULs^f SEE? 
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Figure 18 

A Comparison of OTIC and NASA Indexi 



indicated that these changes have not affected retrieval adversely. The 
pertinency level of retrieval has remained high throughout the introduction 
and use of the NLD. 

Time Savings . Two concerns in the field of library and information 
science are the ever-growing amounts of material to be classified, stored, 
and disseminated, and a constant need to do more work for less or the same 
amount of money. Information scientists are looking for ways to get 
information to the user more quickly. We feel that the NLD is making a 
contribution in this area. 

82% of the indexers reported that having index terms provided by the 
NLD makes indexing DTIC documents Taster. The remaining indexers indicated 
that having the suggested NLD terms has no effect on their speed. 

Indexers were asked to estimate the time saved by having NLD terms. 
The average was 5.4 minutes per document. See Figure 19. Indexers then 
were asked to estimate the time required to index a DTIC document with NLD 
terms provided. The average of these estimates was 10 minutes. When this 
figure was compared with the study 1 (pre-NLD) average indexing time of 13 
minutes, a 3 minute difference was noted. The predicted savings per 
document was 2 to 3 minutes. Based on the indexers' estimates, the 
intended goal has been reached and may have been exceeded. This time 
savings obviously speeds up the document turnaround time and can increase 
the timeliness of the product. 

Changes in Work Emphasis . As an indexer tool, the NLD has relieved 
the indexers of having to look up many terms in the thesaurus. The correct 
form is presented for use or for deletion. In place of this rather 
mechanical task, indexers are asked to watch for coordinations of DTIC 
terms that should be translated to single NASA terms. This process should 
result naturally from the review of the indexing terms presented by the NLD 
printout, and the change in emphasis can provide more challenge to the 
indexer's job than just looking up correct forms of terms. 

Shared Resources . Reindexing work that already has been done at 
taxpayer expense is wasteful of government resources. The original and 
primary purpose of the NLD was to utilize indexing done by other agencies. 
The sharing of indexing with DTIC also has brought about sharing of some 
programming and improved quality in the thesauri and lexical dictionaries 
of both DTIC and NASA. 

Stepping Stone . The Lexical Dictionary has been found to be a 
stepping stone to other endeavors. Its Phrase Matching capabilities are 
being expanded and will soon be used to add NASA terms to MARC records. 
The NLD is also a way of approach to machine-aided indexing of abstracts or 
other text. 

Problems 

Different Indexing Philosophies . Indexing philosophies differ from 
agency to agency. This difference must be addressed in translating 
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Estimate of Time Saved 
(7 indexers replied) 


3 minutes saved per document 
2 

10 

4 
3 
1 

15 

75 + 7 = 5.4 minutes average 
Estimate without the outliers 

3 
2 

4 

3 
1 

13 4 5 = 2.6 minutes 
Pooled estimate of time saved 

5.4 

2.6 

TTTff 4 2 = 4 minutes per document 

Estimate of time to index 
(10 indexers replied) 

6 minutes per document 
8 

7.5 
20 

4 

15 

7 

10 

12.5 

10 

100 4 10 = 10 minutes per document 


Figure 19 
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concepts. For example, one agency may index to "Ablative nose cones" and 
another to "Ablation" and "Nose cones". The first is precoordinated, i.e. 
the two concepts are joined in the index term. The second is 
post-coordinated, i.e. coordinated at the time of retrieval. 

Some agencies include in their indexing any broader terms that appear 
in the hierarchy of the term used. For example, if the term used is 
"Hafnium-alpha" the indexer or the system would assign all of the following 
terms: Hafnium, Refractory metals. Metals, and Elements. Another system 
would assign only the most specific term that applies. If the document 
were on the subject of Hafnium-alpha, they would index to that only, or if 
that term were not available, to Hafnium. 

These differences must be taken into account in setting up a lexical 
dictionary system. 

Chemical Compounds, Complexes, Metal Alloys, etc. . If you can't match 
a chemical compound or complex or an alloy term exactly, you may expect 
trouble with translations and retrieval. Coordinations of terms in this 
area of knowledge are likely to produce unwanted citations in retrieval. 
It is important to know how an indexer is instructed to handle these 
concepts. 

Semantics and Scope . The term "Performance tests" in DTIC's 
environment has no restrictions in meaning. In NASA's environment, this 
term applies only to operating equipment. Another agency has the term 
Blowouts and defines it as "high pressure. . .ejection of water, gas, or oil 
from a borehole"; in NASA's Thesaurus, Blowouts is related to Tires and 
Fatigue life. These are homonyms, two terms that match character for 
character, but convey different concepts. Every term must be examined as 
to its scope and meaning in both the input and target environments. 


Recommendations 

Automation . Automate the initial entries, the continuation entries, 
and use online editing. It will keep the manhours needed for data entry 
and for error correction to a minimum. 

Indexing Policies and Vocabularies . Become familiar with the indexing 
policies and vocabularies of both organizations: the one contributing, and 
your own, the target. 


SUMMARY 


Problem and Proposed Solution 


r . because the NASA STI Facility and the Defense Technical Information 

interests ’ the ^ share information. Twenty percent 
2L the 4 . NASA j> ase was Previously indexed by DTIC. Most of the 

documents rece.ved from DTIC are on microfiche accompanied by a magnetic 

• To avoid the reindexing that was necessarv to aHan+ 

information to the NASA system and y t0 «*>Pt the 

• To save indexing time. 


Construction of the NLD 


The NASA Lexical Dictionary was constructed in four phases. 

nf Phase 0ne center ed on constructing a file consistina 

of entries for every posting term and Use reference in the NASA Thesauri 9 

System Most^V th? 'Jn* references .constructed specifically for the NLD 
system. Most of the programs were written in this phase also The Phrace 

P rand' mT\TA H -tc&wEn n ? 

partial. For example: references. Matches may be complete or 


Input-Any Terms 
Gold 

Gold plate 
Gold plated 
Gold plated chassis 
Gold-plated chassis 


Output-NASA Terms 
Gold 

Gold coatings 

Gold coatings 

Gold coatings, chassis 

Chassis (because gold-plated, with 
a hyphen, has not been added to the 
file yet) 

terms that best express the same concept. For ex^mpU: ^ 0r NASA 


Input-DTIC Term s 
Anti Fogging Agents 


Output-NASA Tprms 
Fog Dispersal 
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Input-DTIC Terms 

Antioxidants 

Apogee 

Approach 

Architects 

Area Bombing 

Area Coverage 


Qutput-NASA Terms 
Antioxidants 
Apogees 
Approach+ 

Architecture, Personnel 
NIS (meaning Not In Scope) 

00 (meaning no equivalent concept) 


Subject Switching Coordinates . Phase Three completed the Subject 
Switching file by adding entries of coordinated DTIC terms translated to 
one or more NASA terms that express the same concept. For example: 


Input-DTIC Terms Qutput-NASA Terms 

Angles; resolution Angular resolution 

Antennas; Gravity waves Gravity wave antennas 

Feedback and Maintenance . Phase Four was concerned with user feedback 
and file maintenance. New terms added to either the DTIC Thesaurus or the 
NASA Thesaurus require additions and modifications to entries in the data 
files. In addition, users can supply feedback as to translations that 
should be added or modified. 


Results 

In the NLD, the NASA STI Facility has a system that translates words 
and phrases from input material into equivalent concepts expressed in NASA 
posting terms. The system was designed particularly to allow the reuse of 
DTIC indexing in the NASA environment. According to a study of 250 DTIC 
documents, 89 percent of the terms assigned to DTIC documents by NASA 
indexers now are suggested by the NLD. The Facility also saves an 
estimated 3 minute of indexing time per document. 

While translating DTIC's index terms to those of the NASA Thesaurus, 
the NLD has preserved the quality of NASA's indexing. Also we know of no 
other system that differentiates concepts that are expressed by homonyms, 
or that coordinates terms in one vocabulary and translates them to the same 
concept expressed in the different terms of another vocabulary. The NASA 
Lexical Dictionary system is not only operating, but it is doing so with 
considerable success. 
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GLOSSARY 


Access 

Routine 


appl ication 
program 


continuation 

entry 

continuation 

symbol 

control led 
vocabulary 


DOE 

DTIC 

element 


exception 

listing 

gloss 


key 


A general purpose computer program that accesses the NLD 
files. The Access Routine never operates independently; it 
is always called by an application program. 

A program that passes to the Access Routine two parameters: 

• a code that indicates whether Phrase Matching or 
Subject Switching mode should be employed, and 

• a character string that is either a word or phrase 
for Phrase Matching, or a set of posting terms 
assigned to a citation by a contributing source for 
Subject Switching. 

An entry that tells the computer to continue to look for 
additional key elements in order to reach the posting term. 

The one or more asterisks (*) or percent signs {%) used in 
the posting term field of continuation entries. 

Terms that are authorized by an organization for their 
indexers to use in listing the subject matter of, or the 
concepts contained in, a document; a list of posting terms 
acceptable to the system and available for use. 

Department of Energy 

Defense Technical Information Center 

An element is part of the key to a record. In the Phrase 
Matching file, each word of the input phrase is an individual 
element. In the Subject Switching file, each contributing 
source posting term (which may be single or multiple words) 
is an element. 

A list of input posting terms which cannot be matched and 
translated by the NLD. 

A parenthetical expression used to clarify the meaning of a 
posting term or Use reference that otherwise might be 
ambiguous. For example: 

LOX (oxygen) 

Pitch (inclination) 

Pitch (material) 

The subject of and a unique field in an NLD record. The key 
consists of terms that may be encountered in the input 
material. The key can consist of a single element, followed 
by a semicolon and two zeros (;00) or multiple elements 


76 


GLOSSARY (Continued) 


LDICT 

LEX. POSTTERM. - 
OTIC 

logic code 
major terms 

minor terms 


NASA 

NLD 

posting terms 
Phrase 

Matching file 


Phrase 

Matching mode 


separated by a semicolon (;). 
Lexical Dictionary 


A sequential file of DTIC posting terms. 


A one character code which indicates how the key is to be 
processed. 

Posting terms that, in the judgment of the indexer, express 
the major concepts and research areas of a document. Major 
terms may be used for online searching and retrieval or to 
generate printed indexes. See also Minor terms. 

Posting terms that, in the judgement of the indexer, indicate 
minor concepts and areas of interest in the information 
presented in a document. Minor terms encompass such aspects 
as properties, characteristics, action determined, relevant 
conditions of the investigation, measurement techniques, and 
instruments or calculations used when these aspects are not 
of primary importance. Minor terms do not appear in 
published subject indexes but may be used for online 
searching and retrieval. See also Major terms. 

National Aeronautics and Space Administration. 

NASA Lexical Dictionary. 

Controlled vocabulary terms that are used by an 
organization's indexers to index documents for the use of 
that organization. 

A file of NASA terms and Use references which are posted to 
valid NASA Thesaurus terms. The file is used as a general 
purpose translation table. This translation table accepts as 
input all of the posting terms and Use reference from the 
NASA Thesaurus as well as additional Use references 
constructed especially for the NLD. 

A general-purpose matching routine which attempts to find 
word-by-word matches between any input phrases and NASA 
posting terms or Use references. 

The Phrase Matching mode can be used to process any type of 
phrase input. This input can consist of document titles, 
freely assigned keywords, or posting terms from a 



GLOSSARY (Continued) 

contributing source Thesaurus for which Subject Switching entries have not 
yet been created. 


PMF 

SSF 

STI 


Subject 
Swi tching 
file 


Phrase Matching file. 

Subject Switching file. 

Scientific and Technical Information, as in the NASA STI 
Facility. 

A file of a contributing organization's authorized vocabulary 
which provides, in the posting term field, one or more NASA 
posting terms that express the same concept. An entry is 
created for every contributed posting term, but in some cases 
the translation may indicate that the term is out of scope or 
not able to be translated. Additional entries are created 
for combinations of input posting terms posted to one or more 
different NASA terms. 


Subject 
Swi tching 
mode 


A special purpose routine that translates the posting terms 
assigned to a document by a particular contributing source 
(such as DTIC) into NASA posting terms. Subject Switching 
translates the concepts represented by the posting terms, in 
contrast to Phrase Matching which looks only for word 
matches. A unique translation table, called a Subject 

Switching file, is built for the posting terms of each 
contributing source. 


Use reference A reference from a posting term that is not in the controlled 
vocabulary to one that is. For example: 


Condensation trails use Contrails 


Use for A posting term that is "used for" a term that is not in the 

reference controlled vocabulary. For example: 

Contrails use for Condensation trails 

VSAM Virtual Storage Access Method. VSAM records, stored on 

direct access devices, may have fields of fixed or variable 
length and may be processed directly or sequentially. 
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APPENDIX A 


PROCESSING OF INPUT WORDS AND PHRASES BY 
THE NLD ACCESS ROUTINE 

The Access Routine determines the logic code associated with each element in 
the input array from the appropriate data file. If an element is not located 
in the file, a logic code of u ?" is assigned for the use of Access Routine 
processing. The logic code controls the way the element will be processed by 
the Access Routine. A "T" logic code indicates that the Access Routine must 
look for combinations between that input element and other elements from the 
input array. Using "T" logic, a search key is created by adding to the "T" 
element a followed by the next element in the input array. The Access 
Routine then tries to match this search key with a key in the data file. 

• If the search key matches a file key which translates to a posting 
term, that posting term is returned. 

• If the search key matches a file key which contains a continuation 
character (*, **, etc.) in the posting term field, then the next 
element from the input array is added to the end of the search key. A 
mutch is again attempted with the file. 

• If no match is found for the search key, then the final element of the 
search key is replaced by the next element of the input array and a 
match is again attempted with the file. 


• If all of the elements from the input array are tried without finding a 
match and the lead element has not been usee in any other successful 
match, then a ,00 is added as the final element of the key for a 
final search attempt. 

The "T" logic processing is repeated with each "T" logic element from the 
input array as the first element of the search key. 

When the logic code of an element is anything other than a "T", the following 
logic is followed: 3 

• If the element has already been used in combination with a "T" logic 
element in a successfully matched search key, then that element is 
skipped. 


• If the element has not already been used in a successfully matched 
search key, the ";00" is added to the end of the element to create a 
search key that is matched against the file. 

The Access Routine returns all of the matches made on the input character 
string to the application program. 

The following example illustrates a simple case of Access Routine processing 
tor the Phrase Matching example given in the section on SYSTEM DESCRIPTION 

. r Lexical Diction ary Access Routine . Subject Switching processing is 

basically the same, but the Subject Switching input array would consist of 
contributing source posting terms sorted alphabetically. 
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Input Phrase: Engine endurance testing research laboratories 


Input Array 

Engine 

Endurance 

Testing 

Research 

Laboratories 


Logic Code 

T 

E 

T 

T 

E 


Phrase Matching File Entries for the Words in the Input Array: 


Logic Code 

E 

T 

T 

T 

T 

T 

E 

T 

T 

T 

T 

T 

T 

T 


Key 

Endurance ;00 

Engine;Control 

Engine;Design 

Engine;Testing 

Engine ; Testing; Laboratories 

Engine;Tests 

Laboratories;!)!] 

Research ;And 

Research ;And;u^. 'elopment 
Research; Facilities 
Research ;00 
Testing;Machines 
Testing;Time 
Testing;00 


Posting Term 

Endurance 
Engine Control 

Engine Design 
★ 

Engine Testing Laboratories 
Engine Tests 

Laboratories 

★ 

Research and Development 
Research Facilities 
Research 

Testing Machines 
Testing Time 
Tests 


Access Routine Processing: 


References are made to the Input Array and Lexical 


Dictionary Entries shown 


Processing Description 

Logic code of first element is T. 

Create search key from first two elements. 

Look search key "Engine;Endurance" up in file. 

Replace final element of search key with next 
element in array. Look search key "Engine; 
Testing" up in file. 

Add next element in the array to the 
search key. Look search key 
"Engine;Testir.g;Laboratories" up in file. 

Move on to next element in array. 

Logic code of second element is E. 

Second element has not been used 
in any T combination. 

Create search key from second element. 

Look search key "Endurance;00" up in file. 


Outcome 


Key not found. 


Key found. Continuation 
symbol returned. 

Key found. Posting term 
"Engine testing 
laboratories" returned 

Key found. Posting term 
"Endurance" returned 


A-2 


Processing Description 


Outcome 


Move on to next element in array. Key not found 

Logic code of third element is T. 

Create search key from third 
and fourth element. 

Look search key "Testing;Research" up in file. 

Replace final element of search key Key not found 

with next element in array. 

Look search key "Testing;Laboratories" 
up in file. 


Key not found. 


No more elements remain in array to be tried 
as a final element of the search key. 

"Testing" has already been used in a previous 
successful match. 

End processing for "Testing". 

Move onto next element in array. 

Logic code of fourth element is T. 

Create search key from the fourth and 
fifth elements. 

Look search key "Research ;Laboratories" 
up in file. 

No more elements remain in array to 
be tried as final element of search key. 

"Research" has not yet been used. 

Create search key be adding ":00" to 
"Research" 

Look search key "Research;00" up in file 

Move on to next element in array. 

Logic code of fifth element is E. 

"Laboratories" has already been used 
in a successful match. 

End processing for "Laboratories". 

No more elements in array. 

End processing. 

The final outcome of the processing is that the input phrase "Enqine 
endurance testing research iaberotories" is translated into the NASA posting 
terms Engine testing laboratories", "Endurance", and "Research". 


Search key found. 
Posting term "Research" 
returned. 
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APPENDIX B 


PROCEDURES FOR DETERMINING NASA TRANSLATION FOR DTIC 
TERMS FOR THE SUBJECT SWITCHING DATA FILE 


1.0 INTRODUCTION 

The DTIC/NASA Subject Switching capability requires that a file be 
created containing translations from DTIC posting terms to NASA posting 
terms. Because the creation of this file is a significant effort, 
automated methods have been employed to generate aids for the analysts. 
Lists of potential translations are obtained in the following way: DTIC 
terms are run through the NLD Phrase Matching file. The output is sorted 
alphabetically by DTIC terms in one printout and by NASA terms in 
another. NASA terms are run through the DTIC Lexical Dictionary (LDICT). 
This output is also sorted alphabetically by DTIC and by NASA terms. 

Frequent references to these printouts required new nomenclature for ease 
of communication. Since the printouts are divided into four separate 
volumes, they are referred to as Books and numbered in alphabetical 
order, as follows: 

DTIC/NASA sorted alphabetically by DTIC is Book 1. 

DTIC/NASA sorted alphabetically by NASA is Book 2. 

NASA/DTIC sorted alphabetically by DTIC is Book 3. 

NASA/DTIC sorted alphabetically by NASA is Book 4. 

In order to facilitate the use of the information contained in these four 
Books, additional programs re-sort portions of them and generate files of 
properly formatted candidate entries for the Subject Switching file. 
From the DTIC/NASA translations, separate files are created for: 

• ExtCt Matches - DTIC posting terms for which there are exactly 
matching NASA posting terms and 

• Partial Matches - DTIC posting terms for which there are one or 
more partial phrase matches with NASA posting terms or Use 
references. 

and a printout is created of all: 

• No Matches - DTIC terms for which there were no matches with NASA 
posting terms or Use references. 

From the NASA/DTIC translations a file is created for: 

• Tables - Two or more DTIC posting terms which, used in 
combination, translate to a NASA posting term or terms. 

The record for an entry contains a tentative logic code, a key consisting 
of one or more DTIC terms and a posting term field which consisted of one 
or more NASA posting terms. In the case of the exact matches and partial 
matches, the analysts edit the tentative NASA translations, which were 
generated by the NLD. In the suggested table entries, the analysts edit 
the keys which are tentative DTIC translations which were generated by 
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the DTIC LDICT. The no match printout is a listing, rather than a file, 
of DTIC terms for which the NLD had no matching entry, either complete or 
partial . 

Analysts study these potential entries and handle them according to the 
following set of guidelines covering both general procedures and 
procedures which are unique for each type of entry. 

2.0 GENERAL PROCEDURES 

2.1 RECORD FORMAT 

The record format for the computer generated candidate entries for the 
DTIC/NASA Subject Switching file is as follows: 

Logic code 

A one character code which is entered in the second column of the 
record. The logic code provides information for Access Routine 
processing and describes the relationship between the key and the 
posting terms. 

Logic codes may be one of the following: 

E The single term key and the posting term are equal. 

C There is a change between the single term key and the single term 
posting term. 

L The single term key translates to a list of posting terms. 

I The translation of the single term key is indeterminate and a 

choice is offered to the indexer. 

0 There is no NASA translation for the single term key. 

T There are multiple terms in the key. 

Key 

The key begins in column 4 and consists of one or more valid DTIC 
posting terms. If there is only one DTIC posting term in the key, it 
is followed by ";00." If there are multiple posting terms in the key 
they are separated by semicolons. The key is followed by a "$" which 
separates the key from the posting term. Parentheses are removed from 
any term in the key. 

Posting Term 

The posting term begins following the key and $. The posting term 
consists of one or more NASA posting terms in exactly the same format 
in which they appear in the NASA Thesaurus. If there are multiple NASA 
posting terms, they are separated by commas. 

Note: Entries which are manually coded for data entry using the online 

maintenance software follow the format given above with two exceptions. 
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E$Radar;00$Radar 

S-SHsas?' “ a?.ss :s,rs 

2.2 USE OF SYMBOLS 

t6rmS the 

covereWln^tW^TIcWtesaurus^ 6 T S h “"? S H ted te ™' ^ not 
narrower terms to see if thpv* a J he lndex ® r should look at the 
document in hand they are ap P ro P nate for use with the 

The suggested NASA term is an Array term 

The “» ‘‘eTeu a tSe r0a NASA C0 ; dePt tha " the DTIC **"• 

appropriate from the suggested terns ° r terms that are 


> 

? 


Special rules for symbols: 


1 . 


2 . 


bj en a “ ? " N I s “ s ® d » * n suggested NASA terms should be followed 

indexer choice entry. 1 " 3 " three terms should be suggesVd ini? 

than the DTI^ term . however tffe ^? e , sted NAsA term is broader 

"systems*"", t "equ i pment " ° r etc^ * 2 * 

subject. For example: ’ term for the general 


DTIC 


NASA 


Fire Control Equipment Fire Contrnl 

Adaptive Control Systems Adaptive Control 

suggested Vs aWraSslatW Wen V C ?2^ d1 " at< l d ,1st of N « A terms Is 
then the DTIC terms. * en if the coordinated terms are broader 


2.3 OTHER GENERAL GUIDELINES 


1 . 


2 . 


3. 


4 . 


5. 


6 . 


7. 


T^ e NASA Thesaurus does not contain an exact match, then the 
name of a discipline, the product of the discipline, or the 
instrument used in the discipline may be used interchangeably as 
translation. No ">" or "?" is addeo. For example: 9 y as 

Holography, Holograms 

If the NASA Thesaurus does not contain an exact match, then the 
noun form and the gerund form of a term may be used 
interchangeably. No ">" or "?" is added. For example: 

Couplers, Coupling 

If the source vocabulary has only one form of a term, and NASA 
has more than one, then all of the NASA variants should be 
presented followed by "?", as an indexer choice. For example: 

01 Estimates ;00$Estimates?, Estimating? 

Consistency should be maintained between similar switches. Check 
other entries that have been coded and try to follow the same 
pattern when a similar switch is encountered. For example: 

Brigade Level Organizations, Platoon Level Organization, etc. 

JL th ? h NASA Th . e . sauru l s does not contain an exact match, but does 

No">" Sr °U frfri J 6n r the 0 pp 0 V te iS used as the translation, 
no > or { is added. For example: 

Antijamming, Jamming 

° f i Ar . ray termS When possible » use Array terms 
rather than translating a term to "00". If three or fewer 

terms can be substituted for an Array term, do so. For example: 
NOT 0E Ba11ast;00$Ballast@ 

BUT 01 Bal last ;00$Bal last (Mass)?, Ballasts (Impedances)? 

Geographical terms should be translated according to the 
following rules. The rules are listed in order of priority: 

Rivers 


a. 

b. ~ 


c. 


d. 


Use the specific "River" term if it is available. 

""basin' 9 tp^rm \f D • aVai1 „ ab ] e 1 1 for the river » 11st both the 

choice. 1V6rS followed by "?" as an indexer 

hd- bhe river belongs in only one country or state list 
"Rivers" and the country or state term. ’ 

If the river is in the United States and belongs in more than 
one state, list "Rivers" and "United States." 9 ™ 
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e. If the river belongs in more than one country, list "Rivers" 
and the continent name. 

Islands 


a. Use the specific island term if it is available. 

b. If the island is part of a larger island group for which 
there is a term, use the broader group term. 

c. Use the "Islands" and the body of water in which the islands 
are located, unless a narrower combined term, such as 
"Pacific Islands" is available. 

d. Use the term "Islands" and the name of a country only if the 
island is both owned by and adjacent to the country. 

Cities, Towns, etc. 


a. Use the specific city term if it is available. 

b. Use the term "Cities" and the state or country in which the 
city is located. 


Seas 


a. Use the specific sea term if it is available. 

b. Use the term "Seas" and the country or continent in which the 
sea is located. 

c. If there is no single appropriate country or continent, use 
the term "Seas" with a>. 

8. Limit a list of posting terms to three terms - two, if possible. 

9. Post any term to 00 (zero zero) in preference to a poor 
translation or one of questionable accuracy. 

3.0 PROCEDURES FOR NO MATCHES 


No computer generated entries are available for No Matches. These 
entries are manually coded and entered using the online maintenance 
software. Because of this different entry method, an additional "$" is 
placed between the logic code and the key for these entries. 

1. Look for the DTIC term in the NASA Thesaurus, Volume 2 : Access 

Vocabulary. This will locate any NASA term that is a format 
variation of the DTIC term, such as singular or 

plural, term inversion, or hyphenated form. It also will locate 
any NASA terms of which the DTIC term is part. If a variant form 
of the DTIC term is found, assign a logic code of C and enter the 
NASA term in the posting field. For example: 

C$Abandonment;00$Escape (Abandonment) 

CSAcetones ;00$Acetone 
C$Acoustooptics;00$Acousto-optics 
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2. If no helpful information is found in the Access Vocabulary, then 
look up the DTIC term in the DTIC Thesaurus to determine if any 
broader terms are listed for that term. If there is a broader 
DTIC term, look up the broader term in Volume 1 of the NASA 
Thesaurus to see if that term exists. Examine the hierarchy of 
that term evaluating all narrower and related terms, for a 
possible translation of the original DTIC term. If the best 
translation of the DTIC term is a broader NASA term, assign a 
logic code of C and enter the NASA Broader Term followed by a 
greater than sign (>) in the posting term field. For example: 

C$Acetonitrile;00$Nitriles> 

3. If approaches 1 and 2 produce no translation for the DTIC term, 
check dictionaries for synonyms or related terms to the DTIC 
term, and look up these terms in Volume 1 of the NASA Thesaurus. 
If it is determined that two or more NASA terms are required to 
express the meaning of the DTIC term, assign a logic code of L 
and enter the NASA terms in the posting term field, separated by 
commas. For example: 

L-$Adaman-tanes-;00$Agents@, Curing 

4. If the DTIC term has two or more equally valid NASA term 
translations, assign a logic code of I and enter the possible 
NASA terms in the posting term field following each NASA term 
with a question mark and separating terms with a comma. For 
example: 

I$Automata;00$Automatic control ?, Automata theory? 

5. If no appropriate N/*iA translation can be found for the DTIC 
term, assign a log'iC code of 0 (zero) and enter NIS in the 
posting term field if the concept is Not In Scope for NASA, or 
enter 00 (zero zero) in the posting term field if the concept is 
in scope for NASA but no term is available to express the 
concept. For example: 

OSAttorneys ;00$NIS 
0$Autumn;00$00 

4.0 PROCEDURES FOR PARTIAL MATCHES 


Procedures for Partial Matches are the same as for No Matches, except 
that computer generated candidate entries suggest one or more NASA terms 
as a possible translation. These suggested NASA translations serve as a 
starting point for research in the NASA Thesaurus. Edit the posting term 
field of the computer generated entry by changing, adding, or deleting 
NASA posting terms based on research in the Thesaurus. Change the 
suggested logic code if it is required by changes made to the posting 
term field. 
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5.0 PROCEDURES FOR EXACT MATCHES 


1. Look up the Exact Match term in both the DTIC Thesaurus and the 

NASA Thesaurus, Volume 1. Check to see whether the NASA and DTIC 
terms appear to express the same concept by checking broader, 
narrower, and related terms as well as the scope notes. If the 

terms express the same concept, go on to procedure 2. If the 

terms do not express the same concept, determine whether or not 

there is another NASA term that does translate the DTIC term. If 

there is no better translation, go on to procedure 2. If a 
better translation is available, assign a logic code of HC and 
post to the different NASA term. 

2. Check both Thesauri to see if the NASA term has narrower terms 
that are not in the DTIC Vocabulary. If so, then place a plus 
sign (+) after the NASA term. This indicates that the indexer 
should consider the NASA narrower terms. 

HE Bolts;00$Bolts+ 

3. Look up the DTIC term in Book 3, NASA/DTIC sorted by DTIC, in 
order to see if multiple NASA terms are translated into the same 
DTIC term. If there are multiple, valid translations, then the 
following type of entry will be created: 

HI DTIC term$NASA term 1? .NASA term 2? 

to indicate that the given DTIC term could have been used to 
express either of the concepts represented by the NASA terms 
listed with a question mark. The indexer must select the 
appropriate NASA term. 

4. If the NASA term sometimes expresses the same concept as the DTIC 
term, and sometimes it does not, and if no better translation has 
been found, then assign a logic code of HI and enter the NASA 
term in the posting term field, followed by a question mark to 
indicate that the indexer must decide whether or not this term is 
appropriate for the document being indexed. 

HI Performance test's ;00$Performance tests? 

(The NASA term applies only to operating equipment.) 

5. If the DTIC and NASA terms express the same meaning, and if there 
is only one entry for the DTIC term in Book 3, then the logic 
code remains HE and the posting term will be the NASA term 
listed. 

HE Europe;00$Europe 

6.0 PROCEDURES FOR DTIC TERM COORDINATIONS (TABLES) 

1. DTIC terms in the key MUST be in alphabetical order. This is a 
requirement of the Access Routine. The term order within keys in 
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the computer generated entries will be correct. 

2 If the translation presented appears reasonable, it should be 
accepted exactly as it is. For example: 

J6T Abrasion;Resistance$Abrasion resistance 

, If other combinations of DTIC terms come to mind which provide 

eLlly «oSd or better translations of the same posting, term, 
code It or them for entry. Remember that the DTIC terms in the 
key MUST be in alphabetical order 

(jjj Abrasion;Wear resistanceSAbrasion resistance 

a Tf multinle Generic levels of the same concept apposr in a 

suggested P table, two entries should be made into the NLD. One 
will include the multiple levels of the concept, as suggested. 
The other will include only the most specific term. 

BT Acids '.Ascorbic acid;Metabol ism$Ascorbic acid metabolism 
add: \l Ascorbic acid;Metabol ismSAscorbic acid metabolism 

Note- Do not create tables with multilevel terms; only use 
multiple terms in the“same hierarchy in a table if they appear 
suggested entries. 

c If the broader generic term is the final term or if any Table is 
’ imbedded in another table, the key in the shorter entry must have 
;00 as the final segment. 

BT Acoustic waves;Excitation;Waves$Acoustic excitation 
bT Acoustic waves;Excitation;00$Acoustic excitation 

6 If more than two DTIC terms are Indicated in a suggested table, 
examine the terms for the most pertinent expression of the 
concept in the fewest possible terms. 

bT Aircraft rockets ;Fins folding fins;Rockets;Vehicies$Folding 
fin aircraft rocket vehicle 

Fins and rockets as individual terms in the key are unnecessary 
and should be eliminated. 

7. Check all DTIC terms in the suggested table for possible nar ^er 
terms that would be appropriate substitutions. If an appropriate 
substitution's found, code an additional, table using that term. 
Remember: DTIC terms in the key MUST be in alphabetical order. 

BT Atmospheres Spacecraft cabins$Spacecraft cabin atmospheres 
add: bT Controlled atmospheres Spacecraft cabms$Spacecraft cabin 
atmospheres 
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If a translation appears to be inappropriate, look for a 
substitute term or terms to convey the concept of the NASA term. 

bT Aircraft cabins;Similators;Spacecraft$Spacecraft cabin 
simulators 

Change to: bT Simulators Spacecraft cabinsSSpacecraft cabin 

simulators 

A b in the computer generated entry indicates that the entry came 
from a NASA Use reference. Look up t references in the NASA 
Thesaurus, Volume 1, to find the pertinent "Use For" (UF) 
reference. If the DTIC terms suggested are a reasonable 
translation of the UF reference, use the suggested table. If the 
terms are not a reasonable translation for that UF reference, 
delete the entry. 

bT Aircraft‘,Stars;Warning systems$EC-121 aircrafts 

The UF reference applicable is "Warning Star aircraft". Since 

this suggested table does not translate the concept of "Star 
aircraft" by "stars" and "aircraft", this entry should be 
deleted. 

10. Delete suggested tables posted to specific NASA terms for space 
vehicles, programs, projects, etc. which may be considered 
identifiers by DTIC, or which have no valid equivalent in DTIC's 
vocabulary. For example, delete: 

bT Antisubmarine ammunition;Engines -.Underwater rocketsS 
ASROC engine 

bT Artificial satellites;Biomedicine$BESS (satellite) 

11. When a suggested table indicates that DTIC expresses a NASA term 
for a chemical compound as two terms, consisting of a chemical 
element and a complex, add the word "compounds" to the element if 
DTIC has the element-compound term. 

Computer generated entries: bT Acetates ;Lead$Lead acetates 

bT Aluminum;Hydrides$Aluminum 
hydrides 

Code as: bT Acetates ;Lead compoundsSLead acetates 

bT Aluminum compounds;Hydrides$Aluminum hydrides 

12. A single key may be found going to different posting terms in 
nonadjacent entries on the computer printout. Watch for these 
and examine them carefully. If possible, choose one posting term 
and delete the other. If both posting terms, seem to be 
reasonable translations, list both NASA terms in one table, 
following each term with a question mark and separating them with 
a comma; then delete the other table. 

The automatically generated entries will appear as follows: 


0T Acoustics ;Stabi 1 i tySAcoustic instability 
0T Acoustics ;Nozzles$Acoustic nozzles 
0T Acoustics;Stability$Frequency stability 
0T Acoustics;Nozzles$Sonic nozzles 

The analyst's revision results in the following two entries to 
replace the four automatically generated entries: 

0T Acoustics ;Stabil i tySAcoustic instabil itv?, Frequency 
stability? 

j4T Acoustics ;Nozzles$Acous tic nozzles?, Sonic nozzles? 

13. When a printed table suggests another table which you wish to 
add, be sure that the DTIC and NASA concepts are the same. If 
necessary, add a second NASA term to achieve this. 

Computer generated entry: 

0 T Additives;Propellants$Propellant additives 

add entries: 


0T Additives ;Rocket propel lantsSPropellant additives, 
rocket propellants 

0T Add itives;Sol id propel lantsSPropellant additives, 

Solid propellants 

14. Don't construct a table going to a list of NASA terms if the 
separate elements of the key already are or should be posted 
individually to the same terms you intend to list. 

Examples of tables which should not be constructed: 

0T Machinery; Performance testsSMachinery, Performance 
tests 


because DTIC's term "Machinery" 
and DTIC's term "Performance 
"Performance tests" (followed by 
which should not be constructed 


goes to NASA's term "Machinery" 
tests" goes to NASA's term 
a question mark). Another table 
is: 


0 T Aire raft; Composite structures ;Construction$Aircraft 
structures, Composite structures 


because DTIC terms Aircraft and 'Construction" translate to the 
NASA term "Aircraft structures" while "Composite structures" 
translates to "Composite structures". 
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